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Abstract 

It appeared recently that the underlying degree distribution of networks may 
play a crucial role concerning their robustness. Empiric and analytic results have 
been obtained, based on asymptotic and mean-field approximations. Previous work 
insisted on the fact that power-law degree distributions induce high resilience to 
random failure but high sensitivity to attack strategies, while Poisson degree distri- 
butions are quite sensitive in both cases. Then, much work has been done to extend 
these results. 

We aim here at studying in depth these results, their origin, and limitations. We 
review in detail previous contributions and give full proofs in a unified framework, 
and identify the approximations on which these results rely. We then present new 
results aimed at enlightening some important aspects. We also provide extensive 
rigorous experiments which help evaluate the relevance of the analytic results. 

We reach the conclusion that, even if the basic results of the field are clearly 
true and important, they are in practice much less striking than generally thought. 
The differences between random failures and attacks are not so huge and can be 
explained with simple facts. Likewise, the differences in the behaviors induced by 
power-law and Poisson distributions are not as striking as often claimed. 

Categories and Subject Descriptors: A.l [Introductory and Survey]; C.2.1 [Com- 
puter-Communication Networks]: Network Architecture and Design - Network 
topology; G.2.2 [Discrete Mathematics]: Graph Theory - Network Problems; 

General Terms: Experimentation, Reliability, Security 



Introduction. 

It has been shown recently, see for instance (5j HU [97J 11121 I120j . that most real- world 
complex networks have non-trivial properties in common. In particular, the degree distri- 
bution (probability pk that a randomly chosen node has k links, for each k) of most real- 
world complex networks is heterogeneous and well fitted by a power-law, i.e. p^ ~ k~ a , 
with an exponent a between 2 and 3 in general. This property has been observed in 
many cases, including internet and web graphs pEl ESI UHl Ell El HDSJ HU [1221 
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[271 [1231 [231 El U EH M> El, social networks [831 E21 [931 EH HO] , and biological networks 
[731 [TTT[ 1691 150]. 

In most of these cases, the existence of a path in the network from most nodes to 
most others, called connectivity, is a crucial feature. For instance, in the case of the 
internet, it means that computers can communicate; in the case of the web if means that 
one may reach most pages from most others by following hyperlinks; and in the case 
of social networks it conditions the ability of information and diseases to spread. Note 
that connectivity may be a desirable feature (in the case of the internet for instance), 
or an unwanted one (in the case of virus propagation, for instance), depending on the 
application under focus. 

Networks are subject to damages (either accidental or not) which may affect connec- 
tivity. For instance, failures may occur on computers on the internet, causing removal of 
nodes in internet and web graphs. Likewise, in social networks, people can die from a 
disease, or people deemed likely to propagate the disease can be vaccinated, which cor- 
responds to node removals. For the study of these phenomena, accidental failures may 
be modeled by removals of random nodes and/or links in the considered network, while 
attacks may be modeled by removals following a given strategy. 

Networks of different natures may behave differently when one removes nodes and/or 
links. The choice of the removed nodes or links may also influence significantly the 
obtained behavior. It has been confirmed that this is indeed the case in the famous 
paper [7], in which the authors consider networks with Poisson and power-law degree 
distributions!!, and then remove nodes either randomly (failures) or in decreasing order 
of their degree (attacks). They measure the size of the largest connected component (i.e. 
the largest set of nodes such that there is a path in the network from any node to any 
other node of the set) as a function of the fraction of removed nodes. 

The authors of [20] had pursued the same kind of idea earlier. They tried to establish 
whether the connectivity of the web is mainly due to the (very popular) pages with a very 
large number of incoming links by studying the connectivity of the web graph from which 
the links going to these pages have been removed. 

The authors of [7J obtained the results presented in Figure [lj which shows two things: 
the removal strategies play an important role, and the two kinds of networks behave 
significantly differently: in particular, it seems that networks with power-law degree dis- 
tributions are very resilient to random failures, but very sensitive to attacks. This par- 
ticularity is now referred as the Achille's heel of the internet [911 [TO] . In the case of a 
social network on which one wants to design vaccination strategies, it means that one may 
expect better efficiency by vaccinating people with the highest number of acquaintances 
than with random vaccination |4"T1 11041 [331 ES] ■ 

2 More specifically, they considered Erdos-Renyi random graphs and graphs obtained with the pref- 
erential attachment model [4], which are not representative of all networks with Poisson or power- law 
degree distributions. In particular networks obtained with the preferential attachment model are not 
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Figure 1: Size of the largest connected component as a function of the fraction of randomly 
removed nodes (failures) and nodes removed in decreasing order of node degrees (classical 
attacks). We will define properly these different kinds of networks in Section [L2l For 
technical details on our plots see Section II .41 



Since then, much work has been done to extend this initial result. Other kinds of 
failures and attacks, in particular cascade ones, as well as other kinds of topologies, have 
been studied. See for instance [S3 EE! [13 Ell E21 CED EE! E7J E21 EDI ESI ES QEH1 CCS] • 
Some studies introduced other criteria for measuring the state of the network, see for 
instance [T8"l [371 |22"1 1101] ■ Cases where the underlying networks have non-trivial degree 
correlations have also been studied, see for instance [XT ] 1119] , Recent studies focused on 
the identification and design of robust topologies, or repair strategies, see for instance [1181 
H071 [TUm [TT51 EH ESI E51 E3 G23 Eni ESI H3l CH] . More details are given in section IT51 

Important efforts have been made to give analytic results based on mean-field approx- 
imations completing the experimental ones, see in particular [301 ED EH [25], EH [5]. Our 
aim is to present these results in detail, to deepen them with new results, and to discuss 
their implications. 

A significant part of this paper is therefore devoted to detailing the existing proofs of 
previously known results. Indeed, these proofs rely strongly on mean-field approximations 
which are classical in statistical mechanics, but quite unusual in computer science. They 
are therefore only sketched in the original papers, and many approximations are made 
implicitly. Therefore we deem in important to give proofs with full details and explicit 
approximations. 

The original papers moreover focus on specific cases of interest. We give here a unified 
and complete view of the questions under concern, including some new results, which 
makes it possible to deepen significantly our understanding of the field. In particular, 
we give results concerning failures and attacks on links, as well as results on finite cases, 
which have received little attention. We also compare the two different approaches for 
the study of power-law networks proposed in [2U EDI El] and in [25, 96J. 

equivalent to the random power-law networks we consider in this paper, see Section [1.21 
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This paper may therefore be considered as an in-depth and didactic survey of the 
main current results of the field, with the aim of unifying the different approaches and 
questions that have been studied, which leads to the introduction of some new results. 

This paper is organized as follows. Section [1] is devoted to preliminaries, which consist 
of definitions and models, of methodological discussions, and of some preliminary results. 
In particular, the approximations made in the proofs in this paper are presented and 
explained. Sections [2] and [3] deal with failures and attacks, respectively. We present 
classical results of the field, as well as several new results which aim at improving our 
understanding of the phenomena under concern. We discuss the behavior of several real- 
world complex networks in Section HI and compare them to expected behaviors from 
theory. Finally, we give an in-depth discussion and synthesis of our understanding of the 
field in Section [5j 

1 Preliminaries. 

Before entering the core of the paper, we need some important preliminaries. They consist 
of preliminary definitions and results, mainly concerning probability distributions and the 
models we will consider, but also of methodological aspects. This section should be read 
carefully before the rest of the paper since important notions are introduced and discussed 
here. 

Let us insist on the fact that most results in this paper are obtained using approxima- 
tions, aimed at simplifying the computation. These approximations are valid in the limit 
of networks with large sizes. They typically consist of neglecting the difference between 
iV and N ±n when n is small compared to N, or of supposing that random values are 
equal to the average. More subtle approximations are also done, belonging to the mean- 
field approximation framework, classical in statistical mechanics and widely used in the 
context of complex networks [4"0l [Q [99] . 

It is important to understand that the proofs we provide are valid only in this frame- 
work, as we have no formal guarantee that all approximations are valid. This is why we 
will always explicitly point out the approximations we make, and we will always compare 
analytic results to experiments. Moreover, we believe that efforts should be made to ob- 
tain exact results and proofs in this context: most results presented here are currently 
beyond the areas to which exact methods have been applied with success. Presenting 
exact results is however out of the scope of this paper. 

1.1 Poisson and power-law distributions. 

A probability distribution is given by the probability pk, for all k, that the considered 
value is equal to k. The sum of all pu must be equal to 1. A Poisson distribution 

k 

is characterized by pt = e~ z jr, where z is the average value of the distribution. The 
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probability of occurrence of a value x in such a distribution therefore decays exponentially 
with its difference \z — x\ to the average, which means that, in practice, all the values are 
close to this average. 

A power- law distribution with exponent a is such that pk is proportional k~ a . In such 
distributions, the probability of occurrence of a value x decays only polynomially with x. 
This implies that, though most values are small, one may obtain very large values. In the 
whole paper, we will generally consider exponents between 2 and 4, which are the relevant 
cases in our context (see Section [L2l . but we will also state some results valid out of this 
range. 

We will consider here two types of power-law distributions, which are the most widely 
used in the literature: discrete and continuous power-law distributions. They are both 
defined by their exponent a and their minimal value m. 

The corresponding continuous power-law is a Pareto distribution, such that Cx~ a dx = 
1. C is a normalization constant that we can compute: f°° C x~ a dx = C m = 1. 
We then obtain C = m a_1 (a — 1). To obtain discrete values, we then take pk equal to 
jk+i (j x -a^ x ^ j s proportional to k~ a in the limit where k is large|f|. And finally, 

Pk = m a ~ 1 (a — 1) J^ +1 x~ a dx = m a_1 (/c~ a+1 — (k + l)~ a+1 ). We will mainly use this form 
in the sequel but at some points we will switch back to the continuous form. 

The corresponding discrete power- law distribution is pk = ^ k~ a , k > m, where 
C = J2T=m k~ a * s the normalization constant necessary to ensure that each pk is between 
and 1 and that their sum is 1. In such a distribution, therefore, pk is exactly proportional 
to k~ a for all k > m. In order to simplify the computation, we will always take m = 1 
for discrete power-law distributions in this paper. This implies that C = C( a ) ; where ( is 
the Riemann zeta function defined for a > 1 by C( a ) = Y2T=i ^ Then, pk = k~ a . 

Discrete and continuous power-law distributions each have their own advantages and 
drawbacks. For instance, continuous power-laws are easier to use in experiments than 
discrete ones, which themselves are more rigorous than continuous ones for small values. 
For a more complete discussion on the advantages and drawbacks of discrete and contin- 
uous distributions, see for instance [321 E2]- We will use both of them in the sequel, and 
discuss their differences. 

Bounded distributions 

Given a distribution pk as defined above, one may sample a finite number of values 
from it. In such a sample, there is a maximal value K. Therefore, the actual distribution 
of the values in this sample, i.e. the fraction Pk{N) of values equal to k for each k, is 
slightly different from the original distribution pk- In particular, for all k > K, Pk(N) = 
(while in general pk > 0). We will therefore call these distributions bounded distributions. 

3 One can also define pk to be proportional to x~ a dx, see [35]. This has little impact on the 

obtained results. 
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The difference between bounded and unbounded distributions goes to zero when N tends 
towards infinity, but for any finite value of N it may play a role in our observations. 

We detail below important properties of bounded distributions, starting with their 
expected maximal value K. 

The maximal value K among a sample of iV values from a given distribution pk is a 
random variable, and it is possible to give the exact formula for its expected value: let 
Xi, . . . ,X N be the values sampled from the distribution, and let Y = maXj =1 .jvJQ. Y 
then has the following distribution: 

P(Y = K) = (£pm) N ~ (f>)"- 

k=0 k=0 

It is the probability that all values are lesser than or equal to K, minus the probability 
that all values are lesser than K, i.e. the probability that all values are lesser than or 
equal to K and at least one is equal to K, and its expected value is given by: 

oo 

E[Y] = J2 kp ( Y = k )- 

k=0 

However, deriving numerical values from this formula is too intricate. We will therefore 
use an approximation: 

Lemma 1.1 J3&j For a given distribution pk such that Pk> for all k, the expected 
maximal value K among a sample of N values can be approximated by 

k-x 1 

Proof : The claim is equivalent to J2k Pk — ^ 5 which means that K is such that there 
is only one value larger than K in the sample. Moreover this value must be exactly equal 
to K, otherwise there would be only one value larger than K + 1 and we would have 
Yl^K+iVk = jr, which is impossible since Pk > for all k. □ 

We can apply this result to the three cases of interest: 

Lemma 1.2 For a Poisson distribution with average value z, the expected maximal value 
K among a sample of N values can be approximated by 
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Proof : Direct application of Lemma [1.11 with Pk = e 2 -|j. 



□ 



Lemma 1.3 fffO]/ For a continuous power-law with exponent a and minimal value m, the 

expected maximal value K among a sample of N values can be approximated by K = 
i 

Proof : From Lemma [LTl K satisfies Yli" 1 Pk = 1 — ■h- Therefore, 4p = YIk Pk- We have 
thatp/c = (a-l)m 0_1 x~ a dx. Therefore, -h = (a — l)m a ~ l x~ a dx = m a ~ 1 K~ a+x . 
The result follows directly. □ 



Lemma 1.4 For a discrete power-law with exponent a, the expected maximal value K 
among a sample of N values can be approximated by ((a)(l — j?) = H^]_ 1} where = 
Ylk=i k~ a is the K-th harmonic number for a . 

Proof : Direct application of Lemma [1.11 with pk = □ 

These results may be used in practice to approximate the expected maximal value K 
among a sample of N values. For Lemmas II. 21 and II. 41 it is obtained iteratively by setting 
K — and increasing it until Yl!k=oP k — 1 — ~k- 

Figure [2] plots the estimates of the maximal value for samples of size N = 100 000, 
for the three types of distributions we consider, obtained from the results above, together 
with experimental values obtained by computing the average of the maximum values of 
1 000 sets of iV random values. 

In the case of power-law distributions, our approximations underestimate slightly the 
experimental values. For Poisson distributions, the evaluation fits experiments exactly, 
but for some precise values of the average only. This is due to the fact that K can only 
take integer values in the evaluation: it is actually the first integer such that ^2^=oPk > 
^jt^- Therefore, this sum may sometimes be significantly larger than jfzi, and then 
the evaluation of K is poor. To evaluate this bias, we have plotted the relative error 

(EtoPk)^™ Figure El (left). 

All in all, our approximations for the expected maximal value are quite accurate, and 
we will use them in the rest of the paper. 

An important point here is to notice that, for all the distributions we consider here, 
the expected maximal among iV sampled values grows sublinearly with N. For Poisson 
distributions this is obvious. Lemma 11.31 explicitly states it for continuous power-laws, 
and one may also check the discrete power-law case. As we will see, this is important for 
some approximations we will make in the following, and for some results in Section 13.31 

Until now, we discussed the fact that sampling TV values from a distribution induces 
an expected maximal value. But it also induces an expected distribution of the iV values, 
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Figure 2: Analytic and experimental estimates of the expected maximal among iV = 
100 000 sampled values. Left: for Poisson distributions, as a function of the average 
value; right: for discrete and continuous power-law distributions, as a function of the 
exponent. 



denoted by pk(N), which is different from the original distribution pk- We now study 
more precisely this expected distribution. 

Lemma 1.5 The expected distribution pk(N) of N values sampled from a given distribu- 
tion p k can be approximated, for all k < K , by 

N 

Pk(N) = j^—j p k 

where K is the expected maximal value, related to pk and N by Lemma li.il 

Proof : Since we sample values from a truncated distribution, we must have that Pk{N) 
is proportional to p k for all k < K: Pk{N) = C pk, and that the sum of all Pk{N) 
is 1: YlT=oP k (N) = 1- Moreover, we know from Lemma [TTTI that YIk Pk = ~b- We 
obtain 1 = Y,T=oPk( N ) = J2k=oPk{ N ) = CJ2k=oPk = C {1 - jf), where we neglected the 
difference between J2k Pk an d Y^K-iPk- The claim follows. □ 



Lemma 1.6 For a Poisson distribution with average value z, the expected distribution 
Pk{N) of a sample of N values can be approximated, for all k < K , by 

N e~ z z k 
Pk{N) = iV^T ~kT' 

where K is the expected maximal value, related to pk and N by Lemma li.ffl 

Proof : Direct application of Lemma [1.51 with pk = e~ z jr- n 
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Lemma 1.7 For a continuous power-law distribution with exponent a and minimal value 
m, the expected distribution pk{N) of a sample of N values can be approximated, for all 
m < k < K , by 

where K is the expected maximal value, related to p^ and N by Lemma \1.3[ 

Proof : Direct application of Lemma [1.51 with pk = m a ~ 1 (k~ a+1 — (k + l)~ a+1 ). □ 



Lemma 1.8 For a discrete power-law distribution with exponent a, the expected distri- 
bution Pk(N) of a sample of N values can be approximated, for all k < K , by 

N k~ a k~ a 

Pk(N) 



N ~ 1 C(«) 



K-l 



where = Yl^=i k a is the K-th harmonic number for a , and K is the expected 



maximal value, related to p^ and N by Lemma 1.4 



Proof : Direct application of Lemma [1.51 with p^ = □ 

The results above give a precise description of what one may expect from finite samples 
from Poisson and power-law distributions. They will be useful when dealing with finite 
networks below. 

First moments of a distribution 

The average (k) = YlT=o ^Pk of a distribution p^ is also called its first moment, the i-th 
moment being defined as (k l ) = YlT=o^ 1 Pk- ^ n the continuous case, the i-th moment is 
similarly defined as (k 1 ) = f^ m k l p^. In the whole paper, the first and second moments 
will play a central role. We present here the results we will need about them. Namely, 
we give formulae for the Poisson and power-law cases, both in the infinite case and in the 
case of a sample of finite size N. 

Lemma 1.9 For a Poisson distribution with average value z, the first two moments of 
the expected distribution of a sample of N values can be approximated by 

where K is the expected maximal value, related to Pk and N by Lemma 
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Proof : Direct application of Lemma 11.61 □ 

Lemma 1.10 For a Poisson distribution with average value z, the first two moments are 

(k) = z and (k 2 ) = z 2 + z. 

k 

Proof : Direct computation, with p^ = e~ z jj- □ 

Lemma 1.11 f3^ For a continuous power-law distribution with exponent a and minimal 
value m, the first two moments of the expected distribution of a sample of N values can 
be approximated by 

^ if l<a<2, 

-a+3 J ' 

if 2 < a < 3, 

—a+3 J ' 

if a > 3, 

where K is related to pk and N by Lemma li.<M 

Proof : If we approximate by 1, we obtain directly in the continuous case: (k) = 
m a -\a-l) J*xx~ a dx = ^-y a r ^~ 1 (K- a+2 -m~ a+2 ) and (k 2 ) = m a ~\a-l) J* x 2 x~ a dx = 

Moreover, when N is large, we have K ^> m and we can approximate K~ a+2 — m~ a+2 
by K~ a+2 and K~ a+3 - m~ a+3 by K~ a+3 if 1 < a < 2; K~ a+2 - m~ a+2 by -m~ a+2 
and R- a+3 - m~ a+3 by K' a+3 if 2 < a < 3; and K~ a+2 - m~ a+2 by -m~ a+2 and 
K- a+3 - m- a+3 by -m" a+3 if a > 3. The results follow. □ 



(k) 


= m a - l K- a+2 

— a+2 


and 


(k 2 ) 


= m a ~ 




(k) 


= m ^— i 

a— 2 


and 


(k 2 ) 


= m a ~ 




(k) 


= m s=± 

a— 2 


and 


(k 2 ) 


2 

= m 


a-l 
a-3 



Lemma 1.12 ]3uy For a continuous power-law distribution with exponent a and minimal 
value m, the first two moments are 

(k) = m — if a > 2 and (k 2 ) = m 2 — if a > 3, 

a — 2 a — 3 

and they diverge in all the other cases. 

Proof : Direct application of Lemma 11.111 with K tending towards infinity. □ 



Lemma 1.13 For a discrete power-law distribution with exponent a, the first two mo- 
ments of the expected distribution of a sample of N values can be approximated by 



(k) 



H 



(a-l) 
K 

H K-1 



and (k 2 ) 



H 



(a-2) 
K 



H 



(a) 
K-l 



where = Ylt=i k a is the K-th harmonic number for a , where K is the expected 



maximal value, related to pk and N by Lemma 1.4 
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Proof : Direct application of Lemma 11.81 



□ 



Lemma 1.14 Why For a discrete power-law distribution with exponent a, the first two 
moments are 

<*> = «2Lzii and (k 2 ) = «2Lz?l. 

C(a) C(a) 
Proof : Direct computation, with = ^Ar. □ 

We would like to discuss here the differences between the moments of bounded and 
unbounded distributions. Although the difference between the distributions themselves is 
small, both for the Poisson and the power-law case (the ratio between a bounded and an 
unbounded distribution is approximately N/ (N — 1)), this is not the case for the moments 
of these distributions. In practice, we can notice that for Poisson distributions, the values 
of the first and second moments are almost identical for bounded and unbounded distri- 
butions, while there is a noticeable difference for power-law distributions. This can be 
understood as follows: these differences are strongly related to the quantities Y1T=k+i ^Pk 
and YI^k+i k 2 Pk- I n both cases, values of pk ioi k > K are quite small: YlT=KPk = 1/iV. 
For Poisson networks, this 1/N is distributed among pk which decrease exponentially, and 
K is small. Therefore the values of kpk and k 2 pk for k > K are small. For power-law net- 
works on the other hand, K is large, and the probabilities pk decrease only polynomially, 
therefore values of kpk and k 2 pk for k > K are much larger. 

These observations will explain in the following why in some cases theoretical predic- 
tions for the finite case and for the infinite limit are almost identical for Poisson networks 
and quite different for power-law networks. 

We finally have all the preliminary results we need on distributions; we can now use them 
in the context of complex networks. 



1.2 Modeling issues. 

In this section we detail the models of networks we will consider, then discuss the modeling 
of failures and attacks we will use. We finally present results concerning the connectivity 
of random networks, which will play a key role in the sequel. 



Random networks 

Given an integer N and a distribution pk one can easily generate a network taken uniformly 
at random among the ones having N nodes and degree distribution pk- Indeed, it is 
sufficient to sample the degree of each of the N nodes with respect to pk, then to attach 
to each node as many stubs as its degree, and finally to construct links by choosing random 
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pairs of stubs. If the sum of degrees is odd, then one just has to sample again the degree 
of a random node until the sum becomes even. This model is known as the configuration 
model [32] and is widely used in the literature, see for instance [TBI US, E7J E] . We will call 
random networks all networks obtained using 

itH 

If one chooses a Poisson distribution of average z then one obtains an equivalent of the 
Erdos-Renyi model jH] in which the network is constructed from N initially disconnected 
nodes by adding M = links between randomly chosen pairs of nodes. One then 
obtains a network taken uniformly at random among the ones having N nodes and M 
links. 

As already discussed in the introduction, and as we will see all along this contribution, 
the degree distribution of a network may be seen as responsible for some of its most 
important features (like robustness). Studying random networks with prescribed degree 
distributions is therefore a key issue. Much work has already been done to this regard, see 
for instance 13], [TU3"1 I3"4] . These networks are particularly well suited for formal analysis, 
and most formal results obtained on complex networks in the literature, including the 
ones on robustness, rely on this modeling, this is why we use it here. For a more detailed 
discussion on network modeling and other approaches, see Section 11.51 

We will focus on three classes of networks, namely the ones with Poisson, continu- 
ous power-law and discrete power-law degree distributions, which we will call Poisson 
networks, continuous power-law networks and discrete power-law networks respectively. 

In our experiments, we will consider Poisson networks with average degree z between 
1 and 8, because for z < 1 the networks do not have a giant component (see p. [13] and 
Lemma [1.16p . and we have observed that the behaviors for z > 8 are very similar to and 
easily predictable from the ones observed for z = 8. Concerning power-law networks, we 
will always take the minimal degree m equal to 1, which fits most real- world cases. We 
will consider exponents between 2 and 4 because below 2 the average degree is infinite (see 
Lemmas 11.121 and II . 14j) and above 4 the network has only small connected components, 
as we will see below (see Lemmas 11.171 and I1.18p . Moreover, most real- world cases fit in 
these ranges. 

Let us insist finally on the fact that real-world complex networks may have other 
properties that influence their robustness, like for instance correlations between degrees, 
clustering (local density), and others. Capturing these properties in formal models how- 
ever remains a challenge. See section [T31 for more details. 

We will however discuss them informally when observing the behavior of real-world 
networks in Section HI 

4 These networks may contain loops (links from one node to itself) and multiple links (more than one 
link between two given nodes) in small quantities, which we will neglect in our reasoning as explained in 
Section 11.31 
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Failures and attacks 

There are many ways to model various kinds of failures and attacks. We will focus here 
on removals of nodes and/or links. We will suppose that failures are random, in contrast 
to attacks, which follow strategies. 

Random node failures are then series of removal of nodes chosen at random. Equiv- 
alently, one may choose a fraction of the nodes at random and then remove them all. 
Likewise, random link failures consist of series of removal of links chosen at random. 

Attacks on the other hand follow a strategy for removing nodes or/and links which has 
to be defined. We then say that we observe an attack following this strategy. For instance, 
we presented in the introduction the most famous strategy, which consists of removing 
nodes in decreasing order of their degrees. We will call this the classical attack, and we 
will define other strategies in Section [3j 

Notice moreover that, when one removes a node, one also removes all the links attached 
to it. This leads to the link point of view of node failures and attacks, which consists of 
observing the fraction of links removed during node failures or attacks. 

In the sequel we will consider all these situations: random node or link failures, attacks 
following various strategies, and link point of view of node failures and attacks. In these 
various cases we want to observe the resilience of networks, which requires to use a criterion 
to capture the impact of failures or attacks on a network. We will here consider the size of 
its largest connected component, or more precisely the fraction of nodes in this component 
as a function of the number of nodes or links removed. This captures the ability of nodes 
to communicate, which is central in our context: the smaller this fraction, the greater the 
impact of the removals. Notice however that one may use other criteria to measure the 
impact of failures or attacks, see Section 11.51 for more details. 

Largest connected component 

In many cases, the largest connected component of a random network contains most nodes 
of the network. More precisely, depending on the underlying degree distribution, the size 
of the largest connected component may scale linearly with the size of the network. The 
network is then said to have a giant component. 

There actually exists a precise and simple criterion on the degree distribution of a 
random network to predict if this network will have a giant connected component or not. 
Since most of the results we will discuss later in this contribution rely on an appropriate 
use of this criterion, we recall it here. 

Theorem 1.15 l8E[ [3J E21 WM A random network with size N tending towards infinity 
and with degree distribution p k such that it has maximal value K < iV 1 / 4 almost surely 
has a giant component if and only if: 

K 

(k 2 ) - 2(k) = ^k{k-2)p k > 0. 

fc=0 
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This theorem has been rigorously proved in [86j E] arid has been proved in the mean- 
field approximation framework in [301 96J. Detailing these proofs is out of the scope of 
this paper. 

This result can be applied to the three kinds of networks we consider here (since their 
maximal degree is sublinear, as explained in Section [TTTT pageCJ), which gives the following 
result s0. 

Lemma 1.16 A Poisson network with size tending towards infinity and average degree z 
almost surely has a giant component if and only if z > 1. 

Proof : Direct application of Theorem 11.151 and Lemma II. 10L □ 

Lemma 1.17 A continuous power-law network with size tending towards infinity, expo- 
nent a and minimal degree m = 1 almost surely has a giant component if and only if 
a < 4. 

Proof : Direct application of Theorem 11.151 and Lemma 11.121 □ 

Lemma 1.18 A discrete power-law network with size tending towards infinity and expo- 
nent a almost surely has a giant component if and only if a is such that > 2. 

Proof : Direct application of Theorem 11.151 and Lemma 11.141 □ 

One may compute the numerical value from this last lemma. One then obtains the 
condition a < 3.48 for discrete power-law networks. In summary, the criterion of Theo- 
rem [LT5] gives very simple conditions under which the random networks we consider have 
a giant component. 

1.3 Mean- field framework and generating functions. 

As already emphasized at the beginning of Section dj most results in this paper are made 
using approximations, valid in the mean-field framework. Most of these approximations 
are classical and very simple, like for instance neglecting small values when compared to 
large ones, but some are specific to random networks and deserve more attention. We 
detail them below. We then present the generating function framework, which makes it 
possible to embed these approximations in a powerful formalism. We finally recall some 
results on generating functions which will be useful in the rest of the paper. 

5 Refer to Section [L3l for the conditions under which the previous theorem is going to be applied. 
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Mean-field approximations in random networks 

The fact that stubs are linked fully at random in a random network is a feature which has 
important consequences in our context. In particular, when one removes a link chosen at 
random in such a network, this is equivalent to the removal of two stubs at random, and 
so the obtained network is still random (with a different degree distribution in general). 
Likewise, when one removes a node, the obtained network is also random. These simple 
remarks will be essential in the following. 

Mean-field approximations are very helpful in the study of random networks since they 
allow to neglect some correlations which would otherwise be very hard to handle. 

Consider for instance the neighbors of a given node, which we will call source node, 
in a large random network. Suppose that the network is sparse (the probability for two 
randomly chosen nodes to be linked together is almost 0) and that its maximal degree is 
small compared to its size, which will always be true in our context. Then the probability 
that two of these neighbors are directly linked together is negligible. Likewise, if we 
take all the nodes at distance 2 of the source node then the probability of having a link 
between two of them is very small and may also be neglected. So does the probability 
to have a link from a node at distance 2 to more than one node at distance 1, or to the 
source. Continuing this reasoning, the network may be considered locally as a tree: any 
subnetwork composed of the nodes at a distance lower than a given finite value is a tree 
if the size of the network tends towards infinity. 

The approximation above relies on the fact that we neglect very small probabilities, 
or equivalently that we consider the limit where the size of the network tends towards 
infinity. 

In the same manner, it is known that any random network with a maximal degree lower 
than a/ (k)N almost surely has no loops or multiple links (29J, [23] - Likewise, Theorem 1 1.1 51 
is formally true only for networks with maximal degree less than N l / A . For both cases the 
conditions might not be true for all the networks under concern, however we will consider 
that the networks do not possess loops and multiple links and that Theorem 11.151 can be 
applied. 

The mean-field framework allows another important approximation which comes from 
the fact that there is no distinction between choosing a stub at random and following a 
link at random from a random starting node. Indeed, since links are formed by pairs of 
randomly chosen stubs, it makes in principle no difference. 

One consequence is that we suppose that there is no correlation between the degree of 
a node and the degrees of its neighbors, i.e. that the random starting node we choose has 
no impact on the neighboring node we will reach. This is indeed true when the maximal 
degree is below iV" 111 ^ 1 / 2 ' 1 /^ -1 ^ [21], but not if the maximal degree is larger. We will 
neglect the possible correlations, which is classical in the mean-field approach, even if the 
above condition is not fulfilled. 

This approximation may be used to describe the degree distribution of neighbors of 
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nodes, in other words the degree of a node reached by starting from a randomly cho- 
sen node and following one of its links chosen at random. According to the mean-field 
approximation above, this is equivalent to choosing a random stub and therefore the prob- 
ability that a random stub belongs to a given node is proportional to this node's degree, 
i.e. the probability of reaching a node of degree k is proportional to k p^- The sum 
of these probabilities must be equal to 1, we therefore obtain the following probability: 

k p k _ k p k 

EJLoipj (k) ■ 

We can derive from this the probability qk that a neighbor of a node has k other 
neighbors, which will be useful in the sequel. It is nothing but the probability that a node 
obtained by following a link has k + 1 neighbors, and so: 

(k + l)p k+1 /s 

Qk = ■ 



(k) 



Basics on generating functions 



Generating functions, also called formal power series, are powerful formal objects widely 
used in mathematics, computer science and physics. They encode series of numbers (sk)k>o 
as functions f(x) = J2kLo s k xk - Operations on the series of numbers then correspond to 
operations on the associated functions, which often are much more powerful. See [121] 
for a general introduction. 

The application of generating functions to the random network context is presented 
in details in [99] . Using them to encode series of probabilities (like for instance degree 
distributions), the authors show how mean- field approximations may be embedded with 
benefit in this formalism. Once this is done, it is possible to manipulate the associated 
notions efficiently and easily. We give an overview of this approach below, and we refer 
to [HH] for a detailed and didactic presentation with illustrations. We follow the notations 
in this reference, and we will use them all along the paper. 

Let us begin by encoding the degree distribution pk by the following generating func- 
tion: 

oo 

G {x) = Y,VkX k . (2) 

k=0 

This function is an encoding of the whole distribution since one may obtain pk by differ- 
entiating it k times, then evaluate it at x — and divide the result by k\: Pk = G^\o)/k\. 
Moreover, we have Gq(1) = YlT=oPk = 1 (this is true for all generating functions encoding 
distributions of probabilities), and the average is given by (k) = YlkLi k Pk = G' Q {1). 

Going further, let us consider the generating function G\ for the number of other 
neighbors of a node chosen by following one random link of a randomly chosen node. 
This number is distributed according to qp., defined in Equation [TJ We then have 

G 1 {x)-^g hX ^ = ^ = -^-. (3) 
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This generating function will be useful in the sequel. For more details on how to use 
generating functions in the context of random networks, see [99J. 

We give now a few results on generating functions which will play an important role. 
These results are rewritings of results in [25j [96] . They aim at expressing the existence of 
a giant component in terms of generating functions. 

Let us consider a random network with degree distribution encoded in Gq. Let 
us suppose that some of its nodes (resp. links, i.e. pairs of stubs) are marked. All 
marked nodes are to be removed, we are therefore interested in components composed of 
unmarked nodes, i.e. sets of unmarked nodes such that there exists a path composed only 
of unmarked nodes (resp. links) between any two of them. We will call such sets of nodes 
clusters and we are interested in the existence of a giant such cluster. 

Let us consider a node reached by following a random link, i.e. a node obtained by 
picking a random stub. We will first compute the number of unmarked nodes that can 
be reached from this node by following links between unmarked nodes (resp. unmarked 
links) only. 

Two cases may occur: either the chosen node (resp. stub) is marked, in which case 
the cluster is of size 0, or it is unmarked. Let us denote by the probability that it is 
unmarked and has k other stubs, and by Fi(x) the corresponding generating function: 
° =0 r k x k . Note that the case where the chosen node (resp. stub) is marked 
plays no role in Fi(x). Note also that -Fi(l) is the fraction of unmarked nodes (resp. links) 
in the network. 

When the size of the network tends towards infinity, the clusters have a limit distribu- 
tion of sizes. We will call finite clusters the ones with a finite size in this limit distribution, 
while we call infinite clusters the others. We denote by Hi(x) the generating function for 
the distribution of the size of such finite clusters. Notice that Hi(x) does not take into 
account infinite clusters, if they exist. 

Lemma 1.19 f2Rl9^ The generating function H\{x) satisfies the following self-consistency 
condition: 

H 1 {x) = l-F 1 {l) + xF 1 {H 1 (x)). 

Proof : The cluster is of size if the chosen node (resp. stub) is marked, which happens 
with probability 1 — -Fi(l) since F\{\) is the fraction of unmarked nodes (resp. links). 

In the other case, let us denote by r& the probability that the initial node has k 
other links, i.e. Fi(x) = X]feLo rfca;fc - Since we consider networks whose size tends towards 
infinity, according to the mean-field framework we can neglect cycles (i.e. multiple paths 
between two nodes) in finite clusters. Then, the size of the cluster is 1 plus the sum of the 
sizes of the clusters at the end of these k links. The distribution for the sum of the sizes 
of k independent clusters is given by H^(x), see [99]. Moreover, the distribution of 1 plus 
a value is obtained by multiplying the corresponding generating function of this value by 
x. We therefore obtain H^x) = 1 - F x (l) + x J2™ = o r k H i ( x ) = 1 ~ ^i(l) + xF 1 (H 1 (x)). 
□ 
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Theorem 1.20 \2h\ \9b^ If r is the fraction of marked nodes (resp. links) such that 
removing all these marked nodes (resp. links) gives a network with no giant component, 
then t is such that F{(1) = 1. 

Before proving this result, we need a new approximation, made implicitly in [25| [96] . 
It consists of assuming that the average size of components in a random network is finite if 
and only if there is no giant component. This is an approximation since one can construct 
graphs such that all the components are of infinite but sub-linear size (thus there is no 
giant component), in which case the average is infinite. Conversely, there may be a giant 
component but a finite average size0. This approximation is however necessary for the 
following proof of Theorem 11.201 

Proof : Suppose we marked enough nodes (resp. links) to ensure that there is no 
giant cluster, or equivalently that there is no giant component in the network where 
marked nodes have been removed. According to the assumption above, the average size 
of components is finite (which does not mean that there is no infinite component) and 



is given by H[(l). From Lemma \1.19\ H[(x) = Fi(Hi(x)) + xF{(Hi(x))H[(x), and since 
Hi(l) = 1, we obtain 



If there is still a giant component in the network the above calculations do not hold 
since Hi(l) is no longer equal to 1. The calculations are valid only for fractions of removed 
nodes (resp. links) in the interval ]r, 1] for a given r which is the fraction of marked (thus 
removed) nodes (resp. links) above which there is no giant component anymore. 

Notice now that the expression above for H[(l) diverges at the point F[(l) = 1, which 
defines r. If we choose to remove a fraction of nodes (resp. links) closer and closer to r, 
but still larger than it, the size of remaining components grows. It keeps growing until 
the point where the fraction of removed nodes (resp. links) is not large enough to destroy 
the giant component. At this point, the average size of finite components tends towards 
infinity. □ 

The result we have just described is very powerful and general. We will see that it 
can be applied to many cases and give simple results with direct proofs: to compute 
the fraction of nodes (resp. links) to remove from a network in order to ensure that the 
resulting network contains no giant component, it is sufficient to give an expression for 
Fi(x) and then to determine the fraction which leads to F[(l) = 1. 

One must however keep in mind that they rely on mean-field approximations, and 
that the formalism sometimes makes it difficult to see exactly when approximations are 
performed. 

6 Computing the distribution of component sizes is a difficult task [861125]. 




18 



We insist on the fact that, in the current state of our knowledge, the above-mentioned 
approximations are necessary to derive the results we seek. It is important however to 
pursue the development of exact methods in order to validate these results and deepen 
our understanding. It is important, too, to know exactly the approximations we make 
and when we make them. We will carefully point out the uses of these approximations in 
the whole paper. 

1.4 Plots and thresholds. 

In all plots of this paper, Poisson, C PL and D PL stand for Poisson networks, continuous 
power-law networks, and discrete power-law networks, respectively. 

The first main kind of plots we will consider in the sequel represents the fraction of 
nodes in the largest connected component of a network as a function of the fraction of 
removed nodes or links. Figured] provides an example. To produce these plots, we sampled 
a large number of networks (typically 1 000) on which we repeated the experiment, and 
then plotted the average behavior. In order to be able to compare the various kinds 
of networks, we selected two typical exponents for the power-law, namely 2.5 and 3, 
produced continuous and discrete power-law networks with these exponents, as well as 
Poisson networks with the same average degrees. These values are summarized in Table [TJ 
The figures of this kind are Figures EJ El El [EH [[3] and [151 





average degree 


exponent 


continuous power-law 


discrete power-law 


2.5 


2.6 


1.9 


3 


1.6 


1.4 



Table 1: The exponents we consider in our experiments on power- law networks, and the 
average degrees they induce (obtained in practice with minimal value m = 1 and N = 
100 000 nodes; they are slightly lower than previsions from Lemma 11.111 for continuous 
power-laws, but fit very well the previsions from Lemma [1.131 for discrete power- laws). 

In our context, it is usual to witness a threshold phenomenon (typical of statistical 
mechanics and more precisely percolation theory, see for instance [111] ): there exists a 
critical value p c such that, whenever the fraction of removed nodes (or links, depending 
on the context) is lower than p c , the network almost surely still has a giant component, 
whereas whenever the fraction of removed nodes (or links) is greater than p c the network 
almost surely does not have a giant component anymore. In other words, the threshold is 
reached when the fraction of nodes in the largest connected component goes to zero (there 
is no giant component anymore). These thresholds play a central role in the phenomenon 
under consideration and will be often studied in the sequel. 

Notice that, for a given finite size network, the notion of threshold does not make 
sense: the fraction of nodes in the largest connected component will never be zero. In this 
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case, there are several ways to define a threshold. One may notice that, when we reach the 
threshold, the slope of the plot of the fraction of nodes in the largest connected component 
in function of the number of removed nodes goes to infinity. In finite-size computation, 
we may therefore consider that we reach the threshold when this slope is maximal [25] . 
Notice that this does not always make sense: it may happen, like in Figure [3] (right), that 
the slope is maximal at (while the expected value of the threshold is closer to 1). One 
can then adopt the convention that in such cases there is no threshold, but this reduces 
our ability to discuss practical cases. 

Another solution, described in }105j , consists of computing the degree distribution of 
the network after each removal of a node or a link, and see if it satisfies the criterion of 
Theorem 11.151 for it to have a giant component. The threshold is then the fraction of 
nodes or links to remove so that the network does not satisfy this criterion anymore. 

The solution we choose is to consider that the threshold is reached when the largest 
connected component contains less than a given (small) fraction of all the nodes. We 
chose a fraction which makes both definitions quite equivalent in our cases, namely 0.05. 
In other words, we consider that a network does not have a giant component whenever the 
size of its largest connected component is less than 5 % of the whole. Notice that changing 
this value may have a impact on numerical results. However, similar observations would 
be made. 

This leads us to the second main kind of plots encountered in this paper. For a given 
node or link removal strategy, these plots represent the threshold as a function of the 
main character of each kind of networks: the average degree for Poisson networks, and 
the exponent of the power-law for power-law networks. We plot experimental results 
obtained by averaging results on large number of networks (typically 1 000), for different 
sizes of networks (typically 1 000, 10 000 and 100 000). To help in the comparison between 
different kinds of networks, we add on these plots vertical lines at the values quoted in 
Table [H We also plot the theoretical predictions we obtain, together with experimental 
results, to make it possible to compare them. The figures of this kind are Figures HI El [8], 
□HI m [Hand us 

We will see that the experimental results do not always fit analytic predictions very 
closely. This is influenced in part by the choice to consider that a giant component must 
contain at least 5 % of the nodes, as explained above. But other factors impact this. 
In the case of random failures, for instance, there is a significant difference between the 
infinite limit and the finite case, even for large sizes. This is why we present results for 
both finite cases and the infinite limit when possible. This makes it possible to observe 
the error due to the asymptotic approximation. More generally, the difference between 
predictions and numerical values are due to the approximations made in the derivations 
of the analytic results. 

For Poisson networks, for instance, we are faced with the same problem as the one 
concerning the evaluation of the maximal degree of finite networks, see Section 11.11 and 
Figure [2J since some parameters can only take integer values, their analytic evaluation 
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may lead to values quite different from their true values. Therefore, we have chosen to use 
only the analytic values of the threshold for which the error due to this effect is minimal. 

Notice finally that the plot for a particular instance may vary significantly from the 
average behavior, in particular for power- law and/or small networks. We do not enter in 
these considerations here. 

1.5 Towards a more realistic modeling 

Modeling large networks is a complex task and many parameters have to be taken into 
account. The modeling approach we use in this paper relies on random sampling: given 
a set of properties of a real network that one wants to reproduce, the goal is to choose 
with uniform probability a graph among the set of all graphs having these properties. 
This approach has the advantage of allowing to study precisely the impact of a given 
property: if a certain behavior is observed on graphs obtained with such a model, then 
we can conclude that this behavior is a consequence of the studied property. This type 
of model is also well-suited for exact proofs. However, it also suffers from limitations: 
some properties cannot currently be reproduced by this type of models. It is for instance 
currently impossible to sample a graph uniformly at random among all graphs having a 
given clustering coefficient, or even a given number of triangles [95 1 I6T 1 160]. In summary, 
this type of model is well-adapted for understanding the impact of some properties and for 
formal proofs, but is not currently able to reproduce all properties of real-world networks. 

In this paper, we focus on the degree distribution of graphs, and use models produc- 
ing random graphs with given degree distributions. However, the sole degree distribution 
cannot reproduce the complexity of networks (for instance it is possible to produce com- 
pletely different networks having the exact same degree distribution). It has also been 
shown that when the sole degree distribution is taken into account, high degree nodes 
tend to be connected to each other, which might not be realistic [46J. 

Efforts have therefore been made to study other properties taking into account the 
tendency of nodes to be linked to nodes of the same degree or not, and incorporate them 
into random models. For instance, this can be captured by the assortativity parame- 
ter [01], degree correlations which are the probabilities P(d\d') that a node of degree d is 
linked to another node of degree d' [TT] I119j . or by the s(g) parameter which is the sum 
of didj for all links normalized by the larger value obtainable on a graph with the 

same degree sequence [HI [81] . 

Some authors have studied network resilience to failures and attacks, in a similar way 
as what we present in this paper, on random networks with degree correlations. In [119] , 
the authors study the resilience of assortative networks (nodes are linked to similar nodes) 
and disassortative networks to failures. The resilience depends on the second moment of 
the degree distribution and the correlations: assortative networks are very resilient even 
when the second moment of the degree distribution is finite, while disassortative networks 
can be fragile when the second moment is divergent. 
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Another, and orthogonal, modeling approach consists of taking into account properties 
that play a role in the construction of a network. For instance, in the case of the internet, 
one can consider the way routers work, or other technological or economic constraints. 
One then iterates an evolution process which respects a set of properties and produces in 
the end a graph similar to the original network. 

This type of approach has the advantage of being able to take into account many prop- 
erties that cannot be considered in random modeling, for instance clustering [1201 [7T| B5| 
HU [?2l [66] . This makes it relevant for producing graphs for simulation purposes. However, 
the evolution rules in these models create graph structures that are hard to characterize, 
and in the end the properties of the obtained graphs are not always fully understood. A 
simple example of this is the preferential attachment model [I], which produces trees (if 
each new node creates a single link) or graphs with no nodes of degree one (if new nodes 
create more than one link). These models therefore allow to study the impact of the 
construction rule on the observed behaviors, but do not allow yet to study the impact of 
some topological properties of the networks. Also, in the case of optimization models, it 
is not always easy to find interesting parameters to optimize. Indeed performance related 
measures are quite natural in the context of computer networks but can be harder to 
find in social networks for instance. In summary, this approach is relevant for taking into 
account complex network properties, and for simulation purposes, but does not allow to 
study precisely the impact of global topological properties. Therefore, these two types of 
approaches are complementary. 

The HOT framework, Heuristically Optimal Topology or Highly Optimized/ Organized 
Tolerance/ Tradeoffs, belongs to this approach. This framework has been mainly used in 
the context of the internet [82]. The authors of these papers propose to rewire a 
network with a given degree distribution in a non random fashion which preserves the 
degree sequence. The nonrandomness consists of rewiring with the aim of optimizing 
some properties of the networks to mimic real properties, for instance technological or 
economical ones. This kind of optimization can also be found in the context of biological 
systems [38J. 

Concerning network robustness, we only consider here the size of the largest compo- 
nent, which describes the ability of nodes to communicate, as an indicator of the state of 
the network after failures or attacks. However other approaches have been introduced. 

In [78], [67] the authors use the efficiency, also called average inverse geodesic length, 
which is computed as 1/(N(N — 1)) 1/dij, where dij is the distance between i and j. 
An high efficiency means that pairs of nodes are on average close to each other. This is 
very similar to the average distance but allows to consider disconnected networks, which is 
pertinent in our context. In a similar fashion, the authors of |lUlj introduce the Diameter- 
Inverse-K (DIK) measure defined as d/K, where d is the average distance between pairs 
of connected nodes, and K is the fraction of pairs of nodes which are connected. It allows 
to take into account disconnected graphs and for connected graphs allows to distinguish 
between short or large average distance. In [37], the authors consider that using shortest 
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paths can put a higher load on some nodes which can increase when failures or attacks 
occur. In consequence, each node is associated with a given capacity that cannot be 
exceeded without a loss of efficiency of the node, which forces the use of longer paths. 
This concept has mainly been used for cascading failures and attacks but could be used 
as a more evolved definition of efficiency. 

More specific measures have also been introduced. In particular, in the case of the 
internet, nodes and links are used to carry some demand and the efficiency of the network 
can be measured as its capacity at carrying it. In [821 H6], the authors define the maximal 
throughput based on the bandwidth of nodes, a routing matrix and the traffic demand 
for all nodes. In the case of the internet, high degree nodes are at the periphery of the 
network and the fragility of the network lies more on the low degree core nodes than on 
the high degrees periphery nodes. 

Some attack strategies based on these measures of importance have also been intro- 
duced. For instance, attack strategies that focus on important nodes rather than on large 
degree nodes. In [57] for instance, the authors compare the removal of highest degree 
nodes with the removal of highest betweenness nodes. Furthermore, the order of removal 
can be either chosen before any removal occurs, or recalculated after each removal. 

Finally, more complex types of failures and attacks, like cascading failures, have been 
considered in the literature, for instance in M, US UM ESI ESI fT08l fT24] . 

We presented here a quick overview of the different types of graph modeling that exist 
in the literature, together with a brief discussion on their advantages and drawbacks. We 
also presented other attack or failures strategies, as well as other definitions of network 
resilience. Studying all this in detail is however out of the scope of this paper. 

2 Resilience to random failures. 

The aim of this section is to study the resilience of random networks to random failures. 
Recall that random node (resp. link) failures consist of the removal of randomly chosen 
nodes (resp. links). 

We will first consider random node failures (Section 12. II) on general random networks, 
and then apply the obtained results to Poisson and power-law networks. We will see that 
the empirical observations cited in the preliminaries concerning the different behaviors of 
Poisson and power-law networks are formally confirmed. In order to deepen our under- 
standing of random node failures, we will consider in Section 12.21 these failures from the 
link point of view: what fractions of the links are removed during random node failures? 
Finally, we will consider random link failures (Section 12.31) . 
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2.1 Random node failures. 



In this section, we first present a general result on random node failures, independent of 
the type of underlying network, as long as it is a random network. We detail the two 
main proofs proposed for this result [25[ EE1 EH EI] . We then apply this general result to 
the special cases under concern: Poisson and power-law (both discrete and continuous) 
networks. Figure E] displays the behaviors observed for these three types of networks. 




Figure 3: Size of the largest connected component as a function of the fraction of randomly 
removed nodes. For technical details on our plots see Section 11.41 

As explained in the preliminaries, there is a fundamental difference between Poisson 
and power-law networks: in the Poisson case the giant component is destroyed when a 
fraction of the nodes significantly lower than 1 has been removed, whereas in the power- 
law cases one needs to remove almost all nodes. The aim of this section is to formally 
confirm this, and give both formal and intuitive explanations of this phenomenon. 

2.1.1 General results 

Our aim here is to prove the following general result, which gives the value of the threshold 
for random node failures. 

Theorem 2.1 [25\ l9E[ [221 [ffffi The threshold p c for random node failures in large random 
networks with degree distribution pk is given by 

(k) 

Pc = 1 



(V) - (k) 



Notice that this theorem states that in some cases p c might be less than 0. But we have: 
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According to Theorem I1.15[ this implies that the network almost surely has no giant 
component. In this case, the notion of threshold therefore has no meaning, and the 
theorem is irrelevant. 

Theorem 12.11 has been derived in different ways in the literature. The two main 
methods were proposed in [30j EI] and in j25j [96] . We detail both approaches below. 

Let us begin with the proof in (30j [35]. It relies on the fact that random node failures 
on a random network lead to a network which may still be considered as random (with a 
different degree distribution), as explained in the preliminaries. Therefore, by computing 
the degree distribution of this network, one can use the criterion in Theorem 11.151 to 
decide if there is still a giant component or not. 

Lemma 2.2 [30[ \3J$ In a large random network with degree distribution pk, after the 
removal of a fraction p of the nodes during random node failures the degree distribution 
Pkip) is given by 



vk{p) = jr v J h \i-p) k p 

k =k ^ ' 



kkg—k 



Proof : If a given node had degree ko before the removal, then the probability that it 
has degree k! < k after the removal is (^) (1 — <p} k ' p k °~ k ' ■ Indeed, k — k! of its neighbors 
have been removed with probability p, and k! of its neighbors have not been removed with 
probability (1 — p). □ 

In order to apply Theorem II .15[ we now have to compute the first and second moments 
of the new degree distribution: 

Proposition 2.3 j3(k\3J$ With the notations of Lemma \2.2[ the first and second moments 
of the degree distribution p^ip) are 

(k(p)) = (1 -p)(k) and (k\p)) = (l-p) 2 (k 2 ) +p{l-p)(k). 

In order to prove this proposition, we need the following technical lemma which gives the 
first and second moment of a binomial distribution. 

Lemma 2.4 For any integer k and ko, and any real p, we have 

fco 



k=0 

and 

fc 



f;fc^)(i-p)*p fcD -* = (i-p)fc b , 



E k 2 M (i - p) v°- k = (i - P )X + P (i - P )k . 

fc=0 ^ ' 
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Proof : Let us start with: 



fc=0 ^ ' 



If we differentiate this equality with respect to x and then multiply the resulting equality 
by x, we obtain: 

fco 

r .k k(j-k 



xk (x + y) k ^ = ^^kx k y" 



fc=0 

We obtain the first claim by setting x = 1 — p and y = p in this equation. 

If again we differentiate the last equation with respect to x and multiply the resulting 

equality by x, we obtain: 



fco / 1 \ 

x(k (x + y)*- 1 + sfc (*b - l)(x + y) fe( - 2 ) = J2 k \ 1°) ^W* "'' 

fc=0 ^ ' 

By setting x = 1 — p and y = p we obtain: 

Efcio fc2 (fc)( 1 -P)V H ^ = (l-p)^o + (l-p) 2 A:o(A:o-l) 

= (l-p) 2 ^+p(l-p)/c , 

which ends the proof. □ 
We can now prove Proposition 12.31 

Proof : The claims follow from the following series of equations. 

(Hp)) = J2T=o k Pk(p) 

= EZokEZ= k p^)a-p) k p k "~ k 

= EZ=oEt kp ko ( k £)(i-p) k P k °- k 
= EZ^oEtoK^i 1 - p) k p k °- k 

= T,Z=oPko( 1 -P) k O 

= a-p)(k) 

where we made an inversion of the sum between the second and third line and used 
Lemma [2.41 between lines four and five. 

= Etok 2 EZ= k p^)a-p) k p k °- k 

= E: = oE;° = o^o(a(i-riv°- fc 

= EZ=oP*oEt k 2 ( k Z)(l-p) k P k »~ k 
= EZ=oPkoi(±-p) 2 k 2 +p(l-p)k ] 

= (l-p) 2 {k 2 )+p(l-p)(k) 
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using the same tricks as for the first series of equations. 



□ 



Finally, this yields the following proof for Theorem 12.11 

Proof : The threshold p c is reached when the network does not have a giant component 
anymore. From Theorem 11.151 this happens when (k 2 (p c )) — 2(k(p c )) = 0. From Propo- 
sition E31 this is equivalent to (1 — p c )[(l — p c )(k 2 ) — (2 — p c )(k)} = 0, which gives the 
result. □ 



Let us now describe the method developed in [251 [96], to obtain Theorem 12. II It relies 
on the use of generating functions (see Section [L~3]) . each node being marked as absent 
with probability p and as present with probability 1 — p. 

Recall that F\ (x) is the generating function for the probability of finding an unmarked 
(i.e. present) node with k (marked or unmarked) other neighbors at the end of a randomly 
chosen link. In our case, Fi(x) therefore is 

oo 

F 1 (x) = -p)q k x k = (l-p)G 1 (x), 

where G\(x) = YlT=o 1 kxk * s ^ ne generating function for the probability of finding a node 
with k others neighbors at the end of a randomly chosen link, defined in Section [L~3l We 
can then prove Theorem 12.11 as a direct consequence of Theorem 11.201 
Proof : From Theorem 11.201 the threshold p c is reached when F[(l) = 1, which is 
equivalent here to (1 — p c )G[(l) = 1. Therefore p c satisfies 

1 

Pc ~ l ~G^r 

We know that Gi(x) = J2T=o Qk% k = YlkLi kpux k ~ l / (k). Therefore G' x {x) = Y^k=2 — 
l)p k x k - 2 /(k) = ^X k(k - l)p k x k - 2 /{k), and G[(l) = ^fe^. This ends the proof. □ 



The two proofs have different advantages and drawbacks. The first one is self contained 
and relies only on classical probabilistic notions, but it is quite long and technical. The 
second one is very concise and simple, but it relies on the generating function formalism, 
which has to be first introduced and understood. These differences do not only have an 
impact on the aspect of the proofs: they also imply that one has to think carefully about 
each approximation in the first approach, while they are hidden in the generating function 
formalism in the second one. As a counterpart, the first approach makes it easier to tune 
and locate approximations precisely. 
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2.1.2 The cases of Poisson and power-law networks 

Theorem 12.11 is valid for any random network, whatever its degree distribution. To study 
the behavior of Poisson and power-law networks in case of random node failures, we 
therefore only have to apply it to these cases. More precisely, we will consider Poisson, 
continuous power-law and discrete power-law networks, and, for each of these classes, 
both finite networks with N nodes and finite networks with size tending towards infinity. 
Comparison with simulations will be provided at the end of the subsection. 

Note that we will derive all the results for finite size networks as corollaries of results 
presented in previous sections. The results for networks with size tending towards infin- 
ity can then be derived either from results of the previous sections, or as limits of the 
corresponding finite cases. 

Corollary 2.5 For large Poisson networks with N nodes and average degree z, the thresh- 
old p c for random node failures is given by 

v _ 1 EL) **/(*-*)! 

E£o*V(*-2)!' 

where K is the maximal degree or the network, related to pk and N by Lemma \1 . 2\ 
Proof : Direct application of Theorem 12.11 using Lemma 11.91 □ 



Corollary 2.6 [30] For Poisson networks with size tending towards infinity and average 
degree z, the threshold p c for random node failures is 

1 

Pc = 1 • 

z 

Proof : Direct application of Theorem 12.11 using Lemma 11.101 □ 



Corollary 2.7 [30] For large continuous power-law networks with N nodes, exponent a 
and minimal degree m, the threshold p c for random node failures is 



Pc 



3— a J 

^fmiV^f - 1 

a— 3 

P±mN^ - 1 

3— a 



-1 



-1 



if a > 3 
if 2 < a < 3 

if 1 < a < 2. 



28 



Proof : We can rewrite Theorem 12.11 into p c = 1 — 1/ {(k 2 ) / (k) — 1). From the approxi- 
mations of (k) and (k 2 ) in Lemma [1.1 11 we then obtain: 

Pc= { 1 - [^lm a - 2 K 3 ~ a - 1] 1 2 < a < 3 
i Ka<2. 

Using the evaluation of K in Lemma [1.3[ we obtain the result. □ 



Corollary 2.8 [30] For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the threshold p c for random node failures is 



1- [f^m-ll 1 ifa>3 
'"' ] 1 z/l<a<3. 

Proof : Direct application of Corollary 12.71 when the size tends towards infinity. □ 



Corollary 2.9 For large discrete power-law networks with N nodes and exponent a, the 
threshold p c for random node failures is given by 



Pc 



rr(a-l) 



rr(a-2) _ rr(a-l) 
n K n K 



where H^f 1 = Y^k=i k a is the K-th harmonic number for a , where K is the maximal 



degree or the network, related to pk and N by Lemma [77^ 



Proof : Direct application of Theorem 12.11 using Lemma 11.131 □ 



Corollary 2.10 JMS$ For discrete power-law networks with size tending towards infinity 
and exponent a, the threshold p c for random node failures is 

Pc = 1 



C(a-2)-C(a-l)" 

Proof : Direct application of Corollary 12.91 when the size tends towards infinity. □ 



We plot numerical evaluations of these results in Figure HJ together with experimental 
results. We also give in Table [2] the thresholds for specific values of the exponent and the 
average degree. 
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prediction. N=1 Q0000 and infinite limit 
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Average degree 
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N = 1000 
N = 10000 
N = 100000 
prediction, N=100000 
prediction, infinite limit 



2.5 3 

Exponent 



Discrete power-law 



N = 1 000 
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\ N = 1 00000 
prediction, N=1 00000 
prediction, infinite limit 



2.5 3 3.5 

Exponent 



Figure 4: Thresholds for random node failures. For technical details on our plots, on 
the computation of thresholds, and for discussions on the origins of differences between 
experiments and predictions, see Section U~M 





continuous 


Poisson 


discrete 


Poisson 




power-law 




power-law 




a 


prev. exp. 


prev. exp. 


prev. exp. 


prev. exp. 


2.5 


1 0.74 


0.62 0.59 


1 0.67 


0.47 0.45 


3 


1 0.55 


0.38 0.34 


1 0.32 


0.29 0.26 



Table 2: Values of the threshold for random node failures on discrete and continuous 
power-law networks of exponents 2.5 and 3, and on Poisson networks having the same 
average degree (see Tabled]). The values are the analytic previsions at the infinite limit 
and the ones obtained for experiments with networks of iV = 100 000 nodes. 



The central point here is to notice that power-law and Poisson networks display a 
qualitatively different behavior in case of node failures. In theory, power-law networks 
have a threshold p c = 1 as long as the exponent is lower than 3 (most real- world cases), 
which means that all nodes have to be removed to achieve a breakdown. On the contrary, 
for Poisson networks only a finite (i.e. strictly lower than 1) fraction of the nodes has 
to be removed. This leads to the conclusion that power-law networks are significantly 
more resilient to node failures than Poisson networks, which confirms the experimental 
observations discussed in introduction. 

However, this result is moderated by the two following observations. First, Poisson 
networks may have a quite large threshold when their average degree grows (which appears 
from both analytic previsions and experiments). Second, and more importantly, power- 
law networks of finite size N are much more sensitive to failures than what is predicted 
for the infinite limit. This is already true from the analytic previsions, and even more 
pronounced for experiments. 

This is particularly clear when one compares the behavior of networks of various kinds 
but with the same average degree, see Table [2j The experimental thresholds of Poisson 
networks are at most 38% smaller than those for power-law networks of the same average 
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degrees. 

We may therefore conclude that power-law networks are indeed more resilient to ran- 
dom node failures than Poisson ones, but that the difference in practice is not as striking 
as predicted by the infinite limit approximations. 



2.2 Link point of view of random node failures. 

As discussed in the preliminaries, one may wonder what happens in networks during 
random node failures in terms of the number of links removed. The plots of the size 
of the largest component as a fraction of the number of links removed during random 
node failures are given in Figure [5] (notice that these plots are nothing but (nonlinear) 
rescalings of the plots in Figure [3]). The question we address here therefore is: how many 
links have been removed when we reach the threshold for random node failures? This is 
not equivalent to random removals of links, which are studied in the next subsection. 




Figure 5: Size of the largest connected component as a function of the fraction of removed 
links, during random node failures. For technical details on our plots see Section 11.41 

Like in the previous subsection, we will begin with general results and then apply 
them to the cases under concern. 

2.2.1 General results 

One can evaluate the number of links removed during random node failures as follows. 

Proposition 2.11 In large random networks, after the removal of a fraction p of the 
nodes during random nodes failures, the fraction of removed links is m(p) = 2p — p 2 . 

Proof : Let us consider a network in which we randomly remove a fraction p of the nodes. 
Since the nodes are chosen randomly, we can assume that the same fraction p of the stubs 
in the network were attached to the removed nodes. Each stub is kept with probability 
(1—p), the fraction of pairs of stubs linking non removed nodes is therefore (1 — p) 2 . This 
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last quantity is the fraction of non removed links and 1 — (1 — p) 2 = 2p — p 2 is finally the 
fraction of removed links. □ 



We can now use this result to study the threshold for random node failures in terms 
of the fraction of removed links. 

Corollary 2.12 The fraction of links removed at the threshold p c for random node failures 
in large random networks with degree distribution p k is 

Proof : Immediate from Theorem 12.11 and Proposition 12.111 □ 



2.2.2 The cases of Poisson and power-law networks 



We now apply the general result above to the cases of interest, which gives a corollary in 
each case. 

Corollary 2.13 For large Poisson networks with N nodes and average degree z, the frac- 
tion of links removed at the threshold p c for random node failures is 

. v -\ fEf-osV^-i)^ 2 

m{pc) = 1 1 



£L*7(*-2) 



where K is the maximal degree or the network, related to N by Lemma \1 . 2\ 
Proof : Direct application of Corollaries 12.51 and 12.121 



□ 



Corollary 2.14 For Poisson networks with size tending towards infinity and average 
degree z, the fraction of links removed at the threshold p c for random node failures is 



m(p c 



1 

i 5 ' 



Proof : Direct application of Corollaries 12.61 and 12.121 



□ 



Corollary 2.15 For large continuous power-law networks with N nodes, exponent a and 
minimal degree m, the fraction of links removed at the threshold p c for random node 
failures is 



m(p c 



1 - 
1 - 

1 - 



m 



2- a • 1 - 2 

3- Q 

a— 3 



f^miV— - 1 

3— a 



a > 3 
2 < a< 3 



1 < a < 2. 
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Proof : Direct application of Corollaries 12.71 and 12.121 



□ 



Corollary 2.16 For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the fraction of links removed at the threshold p c for 
random node failures is 



m{p c 



1 - [f^m - 1] a > 3 

1 1 < a < 3. 



Proof : Direct application of Corollaries 12.81 and 12.121 □ 



Corollary 2.17 For large discrete power-law networks with N nodes and exponent a, the 
fraction of links removed at the threshold p c for random node failures is given by 



m(p c ) = 1 - 



rr(a-l) 



r(a-2) _ u-(a-l) 
K n K 

where H^' = Ylt=i ^ s the K-th harmonic number for a , and K is the maximal 



degree or the network, related to N by Lemma 1.4 



Proof : Direct application of Corollaries 12.91 and 12.121 □ 



Corollary 2.18 For discrete power-law networks with size tending towards infinity and 
exponent a, the fraction of links removed at the threshold p c for random node failures is 



m(p c ) = 1 - 



C(o-l) 



X(a-2)-C(a-l). 

Proof : Direct application of Corollaries 12.101 and 12.121 □ 



We plot numerical evaluations of these results in Figure El together with experimental 
results. We also give in Table [3] the thresholds for specific values of the exponent and the 
average degree. 

As expected, these results are not qualitatively different from what is observed from 
the node point of view. Again, power-law networks are more resilient than Poisson ones, 
but the difference in practice is not as important as in the predictions. 

Notice also that the fraction of removed links is significantly larger at the threshold 
than the fraction of removed nodes. This is a simple consequence of the fact that removing 
a node leads to the removal of both its stubs and some of its neighbors, as explained in 
the proof of Proposition 12.111 
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Figure 6: Thresholds for the link point of view of random node failures. For technical 
details on our plots, on the computation of thresholds, and for discussions on the origins 
of differences between experiments and predictions, see Section I1.41 





continuous 


Poisson 


discrete 


Poisson 




power-law 




power-law 




a 


prev. exp. 


prev. exp. 


prev. exp. 


prev. exp. 


2.5 


1 0.93 


0.85 0.83 


1 0.89 


0.72 0.69 


3 


1 0.80 


0.61 0.57 


1 0.54 


0.49 0.44 



Table 3: Values of the threshold for the link point of view of random node failures on dis- 
crete and continuous power-law networks of exponents 2.5 and 3, and on Poisson networks 
having the same average degree (see Tabled]). The values are the analytic previsions at 
the infinite limit and the ones obtained for experiments with networks of N — 100 000 
nodes. 

2.3 Random link failures. 

Until now we observed the behavior of random network when nodes are randomly removed, 
both from the nodes and from the link points of view. One may then wonder what happens 
when we remove links rather than nodes, still at random. This may model link failures, 
just like random removal of nodes models node failures. 

Typical behaviors for each type of random networks under concern, when one randomly 
removes links, are plotted in Figure [7J Just like in the case of random node failures (see 
Figure[3]), there is a qualitative difference between Poisson and power-law networks. Going 
further, the plots are very similar to the ones for node failures. We will see that the formal 
results for both cases are indeed identical. 

Again, in this section we will first prove a general result which we apply to the three 
cases under concern. We then compare formal results to experiments, and discuss them. 
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Figure 7: Size of the largest connected component as a function of the fraction of randomly 
removed links. For technical details on our plots see Section 11.41 

2.3.1 General results 

The goal of this section is to prove that the the threshold m c for random link failures 
actually is the same as the one for random node failures (see Theorem 12. lft : 

Theorem 2.19 J2R l3l}/ The threshold m c for random link failures in large random net- 
works with degree distribution pk is 

(k) 

mc ~ (k*)-(kY 

Just as we did for Theorem 12.11 we will present the main ways to derive this result, 
as described in [231 EE] an d in [3"Tl I3"4"j . They are very similar to the ones for Theorem 12.11 
therefore we will present them in less detail. 

The first proof is based on the fact that, as explained in the preliminaries, a ran- 
dom network in which one randomly removes links may still be viewed as a random 
network, with another degree distribution. Therefore, one can use the criterion given in 
Theorem 11.151 with this new degree distribution to decide if the network still has a giant 
component. We therefore begin with the computation of the new degree distribution. 

Lemma 2.20 fgj]/ In a large random network with degree distribution pk, after the re- 
moval of a fraction m of the links during random link failures the degree distribution pk(m) 
is given by 

Pk(m) = Pk0 ( k °) (1 - m) k m k °- k . 

k =k ^ ' 

Proof : Removing randomly a fraction m of the links corresponds to removing randomly 
a fraction m of the stubs. If a given node has degree ko before the removals, then the 
probability that its degree becomes k' < ko is ( fe ?)(l — m) k 'm k °~ k ' . Indeed, ko — k' of its 
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stubs have been removed, with probability m k ° fc ', and k' of its stubs are still present, 
with probability (1 — m) k . The result follows. □ 



This leads to the first proof of Theorem 12.191 

Proof : Notice that Lemma 12.201 actually is nothing but a direct rewriting of Lemma 12.21 
on random node failures. Therefore Theorem 12.191 is derived from Lemma 12.201 and The- 
orem 11.151 in exactly the same way as Theorem 12.11 is derived from Lemma 12.21 and The- 
orem 11.151 □ 



The other method used to obtain this result [25] relies on generating functions. As 
explained in the preliminaries, each link is marked as present with probability 1 — m, and 
absent with probability m. 

Recall that Fi(x) is the generating function for the probability that, when following a 
random link, this link is unmarked (i.e. present) and leads to a node with k other (marked 
or unmarked) links emanating from it. In our case, Fi(x) therefore is 

oo 

Fi(x) = - m)q k x k = (1 - m)G 1 (x). 

k=0 

This leads us to the second proof of Theorem 12.191 

Proof : Again, the generating function Fi(x) obtained here is exactly the same as the 
one obtained in Section I2.1.1[ page [271 f° r random node failures. The proof therefore is 
the same as the proof for Theorem 12.11 page [27J □ 



2.3.2 The cases of Poisson and power-law networks 

Since theoretical results for random link failures are the same as those for random node 
failures, we focus in this section on experimental results. See Figure M and Table HI 
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power-law 




power-law 




a 


prev. exp. 


prev. exp. 


prev. exp. 


prev. exp. 
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1 0.90 


0.62 0.60 


1 0.84 


0.47 0.46 


3 


1 0.67 


0.38 0.35 


1 0.41 


0.29 0.27 



Table 4: Values of the threshold for random link failures on discrete and continuous 
power-law networks of exponents 2.5 and 3, and on Poisson networks having the same 
average degree (see Tabled]). The values are the analytic previsions at the infinite limit 
and the ones obtained for experiments with networks of N = 100 000 nodes. 
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Figure 8: Thresholds for random link failures. For technical details on our plots, on 
the computation of thresholds, and for discussions on the origins of differences between 
experiments and predictions, see Section [T~4l 



In principle, these plots and values should be exactly the same as the ones in Figure 0] 
and in Table El This is true for the analytic previsions, but experiments differ significantly, 
which deserves more discussion. 

When we consider the size of the largest connected component as a function of the 
fraction of removed nodes/links, see Figures [3] and then it appears clearly that, though 
the plot seems to reach at the same fraction, they do not have the same shape. Since 
we chose to define the threshold as the value for which the largest connected component 
reaches 5 % of the total number of nodes, the different shapes give different experimental 
thresholds. 

As explained in Section IX .41 the 5 % value is somewhat arbitrary, but we insist on the 
fact that, in the case of power-law networks considered in these experiments, the other 
main method for computing the threshold cannot be applied: in several cases, the slope 
of the plots for these networks is always decreasing (see Section WM . 

Finally, the same conclusions as the ones for random node failures hold: power-law 
networks are more resilient to random link failures than Poisson ones, but the difference 
in practice is not as striking as predicted by the results for the infinite limit. 

2.4 Conclusion on random failures. 

Two main formal conclusions have been reached in this section concerning the case where 
the size of the network tends towards infinity. First, as expected from the empirical 
results discussed in the introduction, Poisson and power-law networks behave qualitatively 
differently in case of (node or link) random failures: whereas Poisson networks display a 
clear threshold, in power-law ones all the nodes or links have to be removed to achieve 
a breakdown. Second, what happens in case of random link failures is very similar, if 
not identical, to what happens in case of random node failures. On the other hand, link 
point of view does not change the observations qualitatively but the fraction of removed 
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links at the threshold is significantly larger than the fraction of removed nodes. This also 
means that the thresholds for the link point of view of random node failures are larger 
than the thresholds for random link failures. 

The qualitative difference between Poisson and power-law networks leads to the con- 
clusion that power-law networks are much more resilient to random failures. This may be 
used in the design of large scale networks, and it may also be seen as an explanation of the 
fact that real-world networks like the internet or biological networks seem very resilient 
to random errors. This has been widely argued in the literature, see for instance [121 [9] . 

These results however concern only the limit case where the size of the network tends 
towards infinity. When one considers networks of a given size N, even for very large 
values of N, then the difference between Poisson and power-law networks often is much 
less striking than predicted. This is even clearer when one considers the link point of 
view. 

3 Resilience to attacks. 

The aim of this section is to study the resilience of random networks to targeted attacks. 
In our context, an attack on a network consists of a series of node or link removals, like 
failures. The difference lies in the fact that the removals are not random anymore; instead, 
the nodes or links to remove are chosen according to a strategy. 

Obviously, one may define many different strategies, and failures themselves could 
be considered as attacks where the strategy consists of choosing randomly. More subtle 
strategies can however be much more efficient to destroy a network. We already presented 
such a strategy, defined in the initial paper on the topic [7], which received since then 
much attention. It consists of the removal of nodes in decreasing order of their degree; 
we will call this strategy a classical attack. 

We will first consider these classical attacks (Section 13. II) on general random networks, 
and then apply the obtained results to Poisson and power-law networks. In order to deepen 
our understanding of classical attacks, we will consider these attacks from the link point 
of view (Section 13.21) : what fractions of the links are removed during classical attacks? 
We will also introduce new attack strategies (both on nodes and on links) to give deeper 
understanding on classical attacks (Section 13. 3j) . Finally we will conclude this section 
with a detailed discussion on the efficiency of classical attacks, as well as other attack 
strategies (Section 13.41) . 

3.1 Classical attacks. 

In this section, we first present a general result on classical attacks, independent of the 
type of underlying network, as long as it is a random network. We detail the two main 
proofs proposed for this result in [25J [96] and [3U EH] . We then apply this general result 
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to the special cases under concern: Poisson and power-law (both discrete and continuous 
versions) networks. Figure M displays the behaviors observed for these three types of 
networks. 
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Figure 9: Size of the largest connected component as a function of the fraction of nodes 
removed during classical attacks. For technical details on our plots see Section II .41 

As explained in the preliminaries, there is no fundamental difference in the behaviors 
of Poisson and power-law networks in case of classical attacks: in both cases the largest 
connected component is quickly destroyed. It is important however to notice that Poisson 
networks are significantly more resilient than power-law ones. The aim of this section is 
to formally confirm these observations, and give both formal and intuitive explanations. 

3.1.1 General results 

Our aim here is to prove the following general result, which gives the value of the threshold 
for classical attacks. 

Theorem 3.1 Wh\ [31^ The threshold p c for classical attacks in random networks, with 
size tending towards infinity and degree distribution p k , is given by 

(k) 

where K(p c ) is the maximal degree in the network after the attack, related to p c by 
Lemma \3.S\ 

As in the case of failures, there are two main ways to derive this result, proposed in 
[251 196] and [2U H3]. They both rely on the following result which concerns the maximal 
degree of random networks after removal of a fraction p of the nodes during a classical 
attack, which we denote by K(p). 
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Lemma 3.2 [31^ In a random network with size tending towards infinity and degree dis- 
tribution pk, after removal of a fraction p of the nodes during a classical attack the maximal 
degree K{p) is given by 

K(p) 
k=0 

Proof : Before the removals, the network has a maximal degree K. The new maximal 
degree K{p) can then be evaluated using Yl?k=K{p)+iPk = P- From Lemma this 
is equivalent to Y^Lkm Pk = P + jf (neglecting the difference between J2T=k Pk an d 
Y^kLx+iPk) • Since N tends to infinity, we can neglect l/N, which gives p = Y^k=K(p)+iPk-> 
hence the result. □ 



In order to compute the threshold for random networks with a given degree distribution 
and size tending towards infinity, one therefore has to proceed in two steps: first compute 
the value of K(p c ) using Theorem 13.11 then obtain the value of p c using Lemma 13.21 
Note that we will mainly use Lemma 13.21 to compute a fraction of removed nodes given a 
maximal degree and not the converse. 

Before entering in the core of the proofs, notice that the above results hold for random 
networks with size tending towards infinity. It would be possible to write equivalent results 
for large networks of finite size N, by taking into account the (original) maximal degree 
given by Lemma [1.1 1 However, we do not consider this case in this section for two main 
reasons. First, and most important, the results for large networks of finite size N would 
be very similar to the ones for the infinite limit. Indeed, the equivalent of Lemma [3.21 for 
finite networks is: 

Kip) 
k=0 

(where we neglected the difference between J^K+iPk an d YlxPk)- For large networks, 
l/N is very small compared to p when p is the threshold for classical attacks (which 
we will prove later in this section). Therefore the maximal degree of a large network 
of finite size N after a classical attack is very close to the one at the infinite limit. 
Second, considering large networks of finite size N would make the following computations 
much more intricate. One must however keep in mind that the case of finite networks is 
tractable, and results can be derived in a very similar way. 

We now give the two main proofs available for Theorem 13. 11 The first one, from 
[3"Tj 131] . uses the following preliminary result. 

Lemma 3.3 f31\/ In a random network with size tending towards infinity and degree dis- 
tribution pk, when a fraction p of the nodes is removed during a classical attack, the 
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fraction s(p) of stubs attached to removed nodes is given by 

l K{p) 

s (p) = 1 - 

where K(p) is the maximal degree of the network after the attack, related top by Lemma WM 

Proof : Each node of degree k has k stubs attached to it. Therefore the fraction of 
stubs attached to all nodes of degree k is equal to kpk/{k). Therefore, the total number 
of stubs attached to removed nodes is s(p) = YuK(p)+i k Pk- At the infinite limit, this 
is equivalent to s(p) = 4r Y^km+i ^Pk, hence the result. □ 



We can now give the first proof of Theorem 13.11 

Proof : The central point here is to understand that the network obtained after the 
removal of a fraction p of the nodes during a classical attack is equivalent to a random 
network on which random link failures occurred. 

Indeed, a classical attack has two kinds of effects: it reduces the maximal degree in 
the network by removing the nodes with highest degree, and it removes the links attached 
to these nodes. If we consider links as pairs of randomly chosen stubs, as explained in the 
preliminaries, then a classical attack removes all the stubs attached to the removed nodes 
and some other stubs, which were linked to removed stubs. Since pairs of stubs are linked 
randomly, this is equivalent to randomly removing the correct number of stubs from the 
subnetwork composed of the nodes which are not removed. This is again equivalent to 
randomly removing half as many links in this subnetwork. 

If the classical attack removes a fraction p of the nodes, the fraction of stubs attached 
to removed nodes is s(p), given by Lemma [3 .31 The probability for any given stub of a 
remaining node to be linked to a stub of a removed node is therefore s(p). Finally, each 
stub attached to a remaining node is removed with probability s(p), which is equivalent 
to the removal of the same fraction of links. 

Since links in this subnetwork are constructed by choosing random pairs of stubs, it 
is also a random network. Moreover, its degree distribution is nothing but the original one 



0<k<K(p) 



with a cutoff (which is the maximal degree after the attack): yp' k = ^Jcf P ) — — 

where K(p) is the maximal degree after the attack. 

We finally obtain that a classical attack is equivalent to random link failures on a 
random network with known degree distribution. The value of the threshold can therefore 
be derived from Theorem 12.191 on random link failures. We then have to relate this 
threshold, which is the number of stubs removed among remaining nodes, to the number 
of nodes removed during classical attacks. 
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We can now apply Theorem 12.191 to compute the fraction s(p c ) of links to remove 
randomly to destroy the random network described above. This gives 

i / \ (k(Pc)) 

1 - s{p c " 



- {k( Pc )y 



where (k(p c )) and (k 2 (p c )) are the first and second moment of the degree distribu- 
tion ( p' h = -r^- ) , which is the degree distribution described above. The first 

V k l -PcJo<k<K( P S 6 

two moments of this distribution are (k(p c )} = Efc=o^ ^Pk/O- — Pc) and (k 2 (p c )} = 

ES c, feW(i-Pc). 

We can finally transform the above relation into the claim: 



1 - s(p c 



<MPc)> 



(k 2 (p c )} - (fc(p c )> 



1 K J^ ] ^ K(Pc) k^ 



<*> to ES c) Hk - i) Pk 

K(pc) 

(k) = J2 k ( k ~ l ^ k - 



k=0 



□ 



The other method used to obtain this result [25J ES] relies on generating functions. As 
explained in the preliminaries, the fraction p of nodes of highest degrees are marked as 
absent, and the others are marked as present. 

Recall that F\(x) is the generating function for the probability of finding an unmarked 
(i.e. present) node with k other (marked or unmarked) neighbors at the end of a randomly 
chosen link. In our case, F\(x) therefore is: 

1 k (p) 

Fx(x) = 1 7TY J kp k x k - 1 . 

k=i 

We can then prove Theorem 13.11 as a direct consequence of Theorem 11.201 

Proof : From Theorem II .20[ the threshold p c is reached when F{(1) = 1. Differentiating 

F x gives the result: F[{x) = ^ ^f=? k(k - l)p k x k - 2 and F{(1) = ^ £g k(k - l)p k . 
□ 
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Again, the two proofs of Theorem 13.11 have different advantages and drawbacks. See 
the comments at the end of Section 12.1.11 

3.1.2 The cases of Poisson and power-law networks 

Theorem 13.11 is valid for any random network, whatever its degree distribution. To study 
the behavior of Poisson and power-law networks in case of classical attacks, we therefore 
only have to apply it to these cases. More precisely, we will consider Poisson, discrete 
power-law and continuous power-law networks, with size tending towards infinity. Com- 
parison with simulations will be provided at the end of the section, see Figure [10J 

Corollary 3.4 For Poisson networks with size tending towards infinity and average de- 
gree z, the threshold p c for classical attacks is given by 

where K(p c ) is the maximal degree of the network after the attack, related to p c by 
Lemma [3~M 

Proof : Direct application of Theorem 13.11 with pk = e~ z z h /k\. □ 

Corollary 3.5 \25j For discrete power-law networks with size tending towards infinity 
and exponent a, the threshold p c for classical attacks is given by 

<S -J#£3 = «<«-i), 

where = J^fcLi k~ a ^ s the K-th harmonic number for a , and K(p c ) is the maximal 
degree of the network after the attack, related to p c by Lemma \3.2l 

Proof : Direct application of Theorem l3.lt with p^ = k~ a /((a). □ 

Corollary 3.6 137] / For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the threshold p c for classical attacks is given by 

WT_ 2 = a-a m f (£wy- _ \ 

\ m J 3 — a \\ m ) J 

where K(p c ) is the maximal degree of the network after the attack, related to p c by 
Lemma \3.2[ 
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Pk- 



Proof : From Theorem l3.ll we have: {^2k=m k{k — ^)Pk) I (k) = 1, thus (k) = Y^=m k 

J2k=m Kr k- Wehavep fc = m a_1 (Ar a+1 - (Jfe + and (k) = m(ot-l)/(a-2), from 

Lemma II. 12L From this, switching back to the continuous form, we obtain Ylt^^m kpk = 
(a - l)m a ~ l j* iPc) kk a ~ l dk = ^m"- 1 (K(p c 



-a+2 



ond moment, Y^=m k 2 Pk = " a l 3 m a 1 (K(p c ) a+3 



a+2 



— m 



k=m 

m~ a+2 ) and similarly for the sec- 

-a+3\ 



The above equality then becomes: 



(a — l)m 



a-l 



m 



-a+3 



K(p c )~ a+2 - m- a+2 



—a + 3 —a + 2 

We can finally transform this equation to obtain the result: 

'K(p c 



m- 



m a+3 


(K(p c ) 


-a + 3 


\ m 


m 


~ (K{Pc) 


—a + 3 


\ m 


m 


~(K(p c ) 


—a + 3 


\ m 



-a+3 



m 



-a+2 



-a+3 



-a+3 



-a + 2 

1 

-a + 2 
1 

-a + 2 



rn 

K{p c ) 
m 

K(Pc) 
rn 



a+2 



- 1 



-a+2 



- 1 



-a+2 



a — 1 
t . 

a - 2 



rn 



-a+2 



a - 2 
1 



a 



□ 



Numerical evaluations of these results can be done by computing the maximal degree 
K(p c ) with the appropriate corollary, and then injecting it into Lemma 13.21 to evaluate 
p c . For Poisson and discrete power-law networks (Corollaries 13.41 and 13. 51) . the equations 
can be solved in a similar way as computing the maximal degree of a random network, 
see Section [1~T1 For continuous power- law networks (Corollary 13. 6p . the equation can be 
solved using a computer algebra system [85]. Notice that this equation is not defined for 
a = 3; one may obtain the threshold for this value as the limit of its values when a tends 
to it. 

Moreover, still in the case of continuous power-law networks, one may use the following 
result, which is simpler and more precise (since it allows non-integer values for the degree), 
instead of Lemma 13.21 



Lemma 3.7 l31f For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the maximal degree K(p) after the removal of a fraction 
p of the nodes during classical attacks is related to p by 

p = m a ' 1 K(p)- a+1 . 
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Proof : From Lemma [3.21 we have p = Y^KM+iPa-dj, which we can approximate to be 
equal to Y^K(v)Pk- Switching back to the continuous definition, we have that p = (a — 



l)m a 1 Jx( p ) x a dx = (a — l)m a 1 
hence the result. 



-ct+l 



J K 



a-l)m a - 1 K- a!+ 7(Q;-l) = rrf^K 



a-l 



□ 



We plot numerical evaluations of these results in Figure [101 together with experimental 
results. We also give in Table [5] the thresholds for specific values of the exponent and the 
average degree. 
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Figure 10: Thresholds for classical attacks. For technical details on our plots, on 
the computation of thresholds, and for discussions on the origins of differences between 
experiments and predictions, see Section [L4"l 
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Table 5: Values of the threshold for classical attacks on discrete and continuous power-law 
networks of exponents 2.5 and 3, and on Poisson networks having the same average degree 
(see Tabled]). The values are the analytic previsions at the infinite limit and the ones 
obtained for experiments with networks of N = 100 000 nodes. 

It appears clearly that both types of networks are very sensitive to classical attacks: 
only a few percents of the nodes have to be removed to destroy them. Moreover, the 
thresholds for power-law networks are much lower than the ones for Poisson networks: 
they are almost one order of magnitude smaller than for comparable Poisson networks. 
These are certainly the main results on the topic and we will deepen them in the rest of 
the section. 
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3.2 Link point of view of classical attacks. 

The classical attack strategy removes highest degree nodes first. Since in a power-law 
network there are some very high degree nodes, this leads in this case to the removal of a 
huge number of links. One may then wonder if its efficiency on power-law networks is due 
to the fact that the number of removed links is much larger than in the case of random 
failures. Likewise, one may wonder if the fact that a classical attack removes much more 
links in a power-law network than in a Poisson one is the cause of its difference of efficiency 
on these two types of networks. These explanations actually have been proposed by some 
authors as an intuitive explanation of the results presented above. 

Figure [11] displays the behaviors observed for these three types of networks. One can 
see there that the thresholds for Poisson and power-law networks are much closer than 
from the node point of view, see Figure [9j One may also observe that, though there are 
significant differences, when one removes from power-law networks as many links as what 
is needed to destroy a Poisson network with the same average degree, then the size of the 
largest connected component becomes very small. 

The questions we address here therefore are: how many links have been removed when 
we reach the threshold for classical attacks? Is this number similar for power-law and 
Poisson networks? And is it similar to the threshold for link failures? We will investigate 
these questions more precisely in this section by first introducing a general result and 
then apply it to the cases under concern. 
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Figure 11: Size of the largest connected component as a function of the fraction of links 
removed during classical attacks. For technical details on our plots see Section 11.41 



3.2.1 General results 

Our aim here is to prove the following general result, which gives the fraction m(p) of 
links removed when a fraction p of the nodes are removed during classical attacks. 
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Theorem 3.8 In a large random network, the fraction m(p) of links removed when a 
fraction p of the nodes have been removed during a classical attack is 

m(p) = 2s(p) — s(p) 2 , 

where s(p) is the fraction of stubs attached to removed nodes, and is related to p by 
Lemma\33. 

Proof : Let us consider a network in which we remove a fraction p of the nodes during 
a classical attack. Let s(p) be the fraction of the stubs in the network attached to the 
removed nodes {s(p) is linked to p by Lemma l3.3p . Each stub attached to a remaining 
node is therefore kept with probability (1 — s(p)), the fraction of pairs of stubs linking non 
removed nodes is therefore (1 — s(p)) 2 . This last quantity is the fraction of non removed 
links, and m(p) = 1 — (1 — s(p)) 2 = 2s(p) — s(p) 2 is finally the fraction of removed links, 
hence the result. □ 



3.2.2 The cases of Poisson and power-law networks 

Theorem 13.81 is valid for any random network, whatever its degree distribution. It makes 
it possible to compute the fraction of links removed at the threshold for classical attacks. 
To study the behavior of Poisson and power-law networks, we therefore only have to 
apply it to these cases. More precisely, we will consider Poisson, continuous power-law 
and discrete-power-law networks. 

In each case, one first has to compute the (node) threshold p c for classical attacks 
using the appropriate corollary in Section [3. 1.21 then use Lemma 13731 to obtain s(p c ), 
before applying Theorem 13.81 

In the case of continuous power-law networks, it is possible to obtain s(p c ) from p c 
more easily as follows. 



Lemma 3.9 131]/ For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, when a fraction p of the nodes is removed during a 
classical attack, the fraction s(p) of stubs attached to removed nodes is given by 

s (p)=p( 2 -)/( 1 -). 



Proof : From Lemma 13731 we know that s(p) = (J2T=K(p)+i kPk)/{k), which we can 

<k=K(p) 



approximate to be equal to (J2T=K( P )kPk)/(k)- Moreover, we have (k) = from 



Lemma 11.121 

Switching back to the continuous case, we therefore have 



s(p) = —(a - \)m a - 1 / kk~ a dk = —(a - l)m a - 1 
(k) Jk(p) \k) 



k 



-Ot+2 



-a + 2 



= m a - 2 K(p)- a+2 . 

K(p) 
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From Lemma [3.71 we finally obtain K(p) = mp 1 ^ hence the result. 



□ 



We plot numerical evaluations of these results in Figure [121 together with experimental 
results. We also give in Table [6] the thresholds for specific values of the exponent and the 
average degree. 
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Figure 12: Thresholds for the link point of view of classical attacks. For technical 
details on our plots, on the computation of thresholds, and for discussions on the origins 
of differences between experiments and predictions, see Section 11.41 
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Table 6: Values of the threshold for the link point of view of classical attacks on discrete 
and continuous power-law networks of exponents 2.5 and 3, and on Poisson networks 
having the same average degree (see Tabled]). The values are the analytic previsions at 
the infinite limit and the ones obtained for experiments with networks of N — 100 000 
nodes. 

The results are striking: the thresholds are much larger from the link point of view 
than from the node point of view (see Table [5] for comparison). More importantly, while 
the number of nodes to be removed is much lower for power-law networks than for Poisson 
ones, the corresponding number of links if similar for both kinds of networks: the links 
thresholds are similar. 

The conclusion from these observations is that the fact that power-law networks are 
rapidly destroyed during classical attacks may be viewed as a consequence of the fact 
that many links are removed. It is however important to notice that the obtained behav- 
ior for power-law networks is not the same as the one obtained if we remove the same 
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amount of links at random (see Figure[8]and Table|4]for comparison). Therefore, although 
the amount of removed links is huge and this plays a role in the behavior of power-law 
networks, this is not sufficient to explain the observed behavior. This means that the 
links attached to highest degree nodes play a more important role regarding the network 
connectivity than random links. 

3.3 New attack strategies. 

In this section we introduce two very simple new attack strategies, one targeting nodes 
(Section 13.3.11) and the other targeting links (Section l3.3.2j) . These strategies are close 
to random failures. Our aim is not to provide efficient attack strategies, but rather to 
deepen our understanding of previous results. 

These two new attack strategies rely on the following observation. We have seen 
(Theorem II . 151) that a random network with size tending towards infinity has a giant 
component if (k 2 ) — 2(k) > 0. This is equivalent to the condition pi < Y1T=3^(^ ~~ ^)Vk 
(where pk is the fraction of nodes of degree k). The fraction of nodes of degree 1 in the 
network therefore plays a key role. The two attack strategies are based on the idea that 
increasing this fraction should quickly break the network. 

Since our aim here is not to compute the exact value of the threshold, but rather to 
understand a general behavior, we will only consider in the sequel the case of networks 
with size tending towards infinity. 

3.3.1 Almost-random node attacks 

The first attack strategy simply consists of randomly removing nodes of degree at least 
2. We call it the almost-random node attack strategy. 

We first present a general result for this strategy, then apply it to the special cases 
under concern: Poisson and power-law (both discrete and continuous) networks. Figure [TBI 
displays the behaviors observed for these three types of networks. 

Although this strategy is barely different from random node failures, it is actually 
much more efficient than failures (see Figure [3] for comparison) . In particular, it has a 
finite threshold for all the types of networks we consider. 

Theorem 3.10 The threshold p c for almost-random node attacks for large random net- 
works with degree distribution p\~, is bounded by 

p c <l-pt- p . 

Proof : When all nodes that had initially a degree higher that one have been removed, 
then the network surely has no giant component anymore since all remaining nodes have 
degree at most 1. All nodes of degree higher than one represent a fraction 1 — pi — po of 
all nodes; this is therefore an upper bound for the threshold for this attack strategy. □ 
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Figure 13: Size of the largest connected component as a function of the fraction of nodes 
removed during almost-random node attacks. For technical details on our plots see 
Section PI 

Theorem 13.101 is valid for any random network, whatever its degree distribution. To 
study the behavior of Poisson and power-law networks in case of classical attacks, we 
therefore only have to apply it to these cases. More precisely, we will consider Poisson 
and power-law networks (both discrete and continuous), with size tending towards infinity. 
Comparison with simulations will be provided at the end of the section, see Figure HH 

Corollary 3.11 For Poisson networks with size tending towards infinity and average 
degree z, the threshold p c for almost-random node attacks is bounded by 

p c < 1 -e~ z (z + l). 

Proof : Direct application of Theorem 13.101 with pk = e~ z z k jk\. □ 



Corollary 3.12 For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the threshold p c for almost-random node attacks is 
bounded by 

p c < l-m a -\l-2- a+1 ). 
Proof : Direct application of Theorem 13.101 with p^ = m a ^ 1 (k' a+1 — (k + l) _a+1 ). □ 



Corollary 3.13 For discrete power-law networks with size tending towards infinity and 
exponent a, the threshold p c for almost-random node attacks is bounded by 

p c <l- 1/C(a). 



50 



Continuous power-law 



Discrete power-law 



N=1000 
N=10000 
N=100000 
theoretical upper bound, infinite limit 



3 4 5 
Average degree 



N = 1 000 
N = 1 0000 
N = 100000 
theoretical upper bound, infinite limit - 



0.4 
0.35 

0.3 
0.25 

0.2 
0.15 

0.1 
0.05 




N = 1000 
N = 10000 
N = 100000 
theoretical upper bound, infinite limit - 



Exponent 



3 3.5 
Exponent 



Figure 14: Thresholds and upper bounds for almost-random node attacks. For technical 
details on our plots, on the computation of thresholds, and for discussions on the origins 
of differences between experiments and predictions, see Section 11.41 
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power-law 




a 


bound exp. 


bound exp. 


bound exp. 


bound exp. 


2.5 


0.35 0.29 


0.73 0.43 


0.25 0.2 


0.57 0.25 


3 


0.25 0.16 


0.48 0.16 


0.17 0.06 


0.41 0.10 



Table 7: Values of the threshold for almost-random node attacks on discrete and con- 
tinuous power-law networks of exponents 2.5 and 3, and on Poisson networks having the 
same average degree (see Tabled]). The values are the ones obtained for experiments with 
networks of iV = 100 000 nodes, and the theoretical upper bounds. 

Proof : Direct application of Theorem 13.101 with = k~ a /((a). □ 

We plot experimental results for the value of the threshold in Figure [TH as well as the 
upper bounds given above. We also give in Table [7| the thresholds for specific values of 
the exponent and the average degree. 

We recall that our aim here is not to obtain an efficient attack strategy, but to study 
the ability of a strategy very similar to random failures to have the same qualitative 
behavior as classical attacks, namely to display a finite threshold for power-law networks. 

To this regard, the values of the thresholds displayed in Table [7] are quite large (one 
has to remove a large fraction of the nodes do destroy the networks), but remain signif- 
icantly lower than 1 and much lower than the thresholds for node failures (see Table [2]). 
This shows that the efficiency of classical attacks relies in part on simple properties like 
removing nodes of degree larger than 1. 
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3.3.2 Almost-random link attacks 



The other attack strategy consists of randomly removing links between nodes of degree 
at least 2, i.e. a node of degree 1 will always stay connected during the attack. We call it 
the almost-random link attack strategy. 

We first present a general result for this strategy, then apply it to the special cases 
under concern: Poisson and power-law (both discrete and continuous) networks. Figure [T5l 
displays the behaviors observed for these three types of networks. 
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Figure 15: Size of the largest connected component as a function of the fraction of 
links removed during almost-random link attacks. For technical details on our plots see 
Section PI 

Although this strategy is barely different from random link failures, it is actually much 
more efficient than failures (see Figure [7] for comparison). In particular, it has a finite 
threshold for all the types of networks we consider, which makes it much more efficient 
on power-law networks. 

Theorem 3.14 The threshold m c for the almost-random link attack strategy for large 
random networks with maximal degree sublinear in the number of nodes and degree distri- 
bution pk is bounded by 

2pi p\ 

mc<1 ~Jk) + W 

Proof : The threshold is bounded by the fraction of links between two nodes of degree at 
least 2. Indeed, when all such links have been removed, the only links left are between a 
node of degree 1 and another node. Therefore the network is nothing but a set of disjoint 
stars (each central node being connected to nodes of degree 1). The size of the largest 
component therefore is less than K + 1, where K is the maximal degree in the original 
network. Hence, if K is sublinear with respect to the number of nodes, so is the size of 
the largest component after the attack. 

Let us now evaluate the number of links between nodes of degree at least 2. This 
quantity is \E\ minus the number of links incident to at least one node of degree 1. The 
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number of such links is given by the number of nodes of degree 1, minus the number of 
links between two nodes of degree 1. 

The number of links between two nodes of degree 1 can be evaluated as follows. There 
are Npi nodes of degree 1, each of them having a probability Np%/2\E\ of being connected 
to another node of degree 10. Therefore the number of nodes of degree 1 adjacent to 
another node of degree 1 is Npi ■ Np\/2\E\ = Np\/(k). Finally, the number of links 
between two such nodes is Np\/2(k). 

From this we have that the number of links adjacent to at least one node of degree 
1 is: Np\ — Np1/2(k), and the number of links not adjacent to any node of degree 1 is: 

2 

\E\ — Npi + Np\/2{k). The fraction of such links therefore is 1 — + jb, hence the 
result. □ 



Corollary 3.15 For Poisson networks with size tending towards infinity and average 
degree z, the threshold m c for almost-random link attacks is bounded by 



m c <l- 2e~ z + e~ 2z . 



Proof : Direct application of Theorem 13.141 with Pk = e z z h /k\, the maximal degree of 
the network being sublinear in the size of the network (see Lemma \1.2\i . □ 



Corollary 3.16 For continuous power-law networks with size tending towards infinity, 
exponent a and minimal degree m, the threshold m c for almost-random link attacks is 
bounded by 

2(a - 2)m a - 2 (l - 2~ a+1 ) ((a- 2)m Q - 2 (l - 2- a+1 " 



(a-l) V. (a-1) 

Proof : Direct application of Theorem 13.141 with pk = m a ~ l (k~ a+1 — (k + l)~ a+1 ), 
(k) = m (Lemma 11.12}) . the maximal degree of the network being sublinear in the 
size of the network (Lemma II. 3ft . □ 



Corollary 3.17 For discrete power-law networks with size tending towards infinity and 
exponent a, the threshold m c for almost-random link attacks is bounded by 

2C(a-l)-l 
mc < 1 7Ti V\ — • 



7 This is an approximation of the real value (N — l)pi/(2\E\ — 1) 
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Proof: Direct application of Theorem l3.14l withpt. = k Q /C(a), (k) = (Lemma ll.141) . 
the maximal degree of the network being sublinear in the size of the network (Lemma ll.4l) . 
□ 



We plot experimental results for the value of the threshold in Figure [16], as well as the 
upper bounds given above. We also give in Table El the thresholds for specific values of 
the exponent and the average degree. 
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Figure 16: Thresholds and upper bounds for almost-random link attacks. For technical 
details on our plots, on the computation of thresholds, and for discussions on the origins 
of differences between experiments and predictions, see Section 11.41 
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bound exp. 


bound exp. 


bound exp. 


bound exp. 


2.5 


0.62 0.55 


0.86 0.51 


0.37 0.35 


0.72 0.32 


3 


0.39 0.22 


0.64 0.23 


0.15 0.07 


0.57 0.15 



Table 8: Values of the threshold for almost-random link attacks on discrete and continuous 
power-law networks of exponents 2.5 and 3, and on Poisson networks having the same 
average degree (see Table [T]). The values are the ones obtained for experiments with 
networks of N = 100 000 nodes, and the theoretical upper bounds. 

As in the case of almost-random node attacks, the values of the thresholds are quite 
large but remain significantly lower than 1. Since our aim is still to study the ability 
of a strategy very similar to random failures to display a finite threshold, this result is 
satisfactory. This shows that the efficiency of classical attacks relies in part on simple 
properties like removing links between nodes of degree at least 2. 

Going further, one may notice that almost-random link attacks perform better than 
classical attacks in terms of the number of removed links (see Table [6]). This shows that 
classical attacks, although they focus on high degree nodes, actually remove many links 
connected to nodes of degree one, which play little role in the connectivity of the network. 
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The simple almost-random strategy, on the opposite, focuses on those links which really 
disconnect the network. 

3.4 Conclusion on attacks. 

There are two main formal conclusions for this section. First, as expected from the em- 
pirical results discussed in introduction, power-law networks are very sensitive to classical 
attacks, much more than Poisson networks. Second, the link point of view shows that 
many links are actually removed when the thresholds for classical attacks are reached. 
Moreover, very simple attack strategies close to random node or link failures also lead to 
finite (and reasonably small) thresholds. 

Altogether, these results make it possible to discuss precisely the efficiency of classical 
attacks. First, although the number of links removed during such attacks is huge, this 
is not sufficient to explain the collapse of the network. Indeed the removal of the same 
number of links at random does not collapse the network. Second, the number of removed 
links during classical attacks in a Poisson network and in a power-law network are very 
similar. This moderates the conclusion that power-law networks are particularly sensitive 
to classical attacks, since in terms of links both are equally robust. 

Finally, the attack strategies we introduced, which are very close to random failures, 
show that the efficiency of classical attacks relies strongly on simple properties like re- 
moving nodes of degree larger than 1 and links between nodes of degree at least 2. 

4 Resilience of real- world networks. 

So far we have presented theoretical results as well as experiments on failures and attacks 
for different types of random networks. We have seen that, apart from the general shape 
of their degree distributions, precise properties of the networks under concern, like for 
instance their fraction of nodes of degree 1, may play a crucial role in their behavior in 
case of failures or attacks. Other properties not captured by the models, like clustering for 
instance, may also play an important role. In order to give some insight on the practical 
incidence of the results above, we present in this section empirical results on real-world 
complex networks. 

We will consider the following real-world cases, which are representative of the ones 
considered in studies of this field. The actor network is composed of movie actors which 
are linked together if they played in the same movie. It is obtained from the Internet 
Movie DataBase [68]. See [1201 [9] for results on this network. The co-authoring network 
is composed scientific authors, two authors being linked together if they signed a paper 
(present in the archive) together. It is obtained from the arXiv site [8] See [92l [93] for 
results on this kind of networks. The cooccurrence network is composed of words of the 
Bible [15], two words being linked if they belong to the same sentence. See [51] for results 



55 



network 


^ nodes 


links 


avg. deg. 




3Q9 341 


1 5 038 083 
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co-authoring 


16402 


29 552 
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cooccurrence 


9 297 


392 066 


84.342 


internet 1 


228 263 


320 149 


2.805 


internet-core 


75 885 


357317 


9.417 


protein 


2115 


4480 


4.236 


www 


325 729 


1497135 


9.192 


p2p 


159 541 


17454369 


218.807 



Table 9: Number of links, nodes and average degree of the real- world networks under 
study. 

on this kind of networks. The internetl and the internet- core networks are two internet 
maps where the nodes are routers in the internet, two routers being linked if they are at 
one hop at the IP level. These networks are described and studied in [59l [62] . The protein 
network is composed of proteins, two proteins being linked if they interact together in 
a cell. It is obtained from [TTj . See [HS] for results on this kind of networks. The www 
network is composed of web pages, two pages being linked together if there is a hyperlink 
from one of them to the other. This sample is obtained from [TTJ. See [6] for results 
on this kind of networks. Finally, the p2p network is a set of exchanges between peers, 
captured in a running peer-to-peer system, two peers being linked if they exchanged a 
file during the capture. See [631 E3] f° r precise description and results on this network. 
We give in Table [9] the number of nodes and links of these networks, together with their 
average degree. More detailed description of these data can be found in the references. 

We will discuss these networks' resilience to failures and attacks and will explain it 
using our knowledge of their structure and the results presented in this paper. Note that 
these networks are obtained through intricate measurement procedures. It is known that 
the obtained views of the networks cannot always be considered as representative of the 
original networks, see for instance [T6"]lll6j in the general case and [62 | HU[. [27 ] [26l [2T| [T| [T5] 
in the case of the internet. However, our goal is not to address such issues and we do 
not claim to give exact results on these networks. Our goal is rather to illustrate our 
approach. 

Figures [12] to [2H] show the obtained plots, together with the actual degree distribution 
of the network. We also present in Figures [T7] and [TS] the behavior of power-law random 
networks in case of failures and attacks, in order to compare it to the behavior of our 
real-world networks. We chose the two types of power-law networks that yielded the 
most different behaviors, namely continuous power-law networks with exponent 2.5 and 
discrete power-law networks with exponent 3, in order to span all existing behaviors for 
these networks. 
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Figure 17: Behavior of continuous power-law networks with exponent 2.5 in case of failures 
and attacks. 
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Figure 18: Behavior of discrete power-law networks with exponent 3 in case of failures 
and attacks. 



57 




1 10 100 1000 10000 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 

Degree Fraction nodes removed Fraclion links removed 



Figure 19: actor network. From left to right: degree distribution, node removals and link 
removals. 
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Figure 20: co- authoring network. From left to right: degree distribution, node removals 
and link removals. 
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Figure 21: coccurrence network. From left to right: degree distribution, node removals 
and link removals. 



First notice that none of these networks has a Poisson structure, though their degree 
distribution sometimes is quite far from a power-law. In all the cases the degree distribu- 
tion is highly heterogeneous, which makes the discussed attack strategies relevant. Notice 
however that some networks (in particular, the actor, cooccurrence, internet-core and p2p 
networks) have a significantly lower fraction of nodes of degree one than what can be 
expected for power-law networks. 

It appears that in all the cases the behavior for random node failures and classical 
attacks matches the theoretical expectations: there is a significant difference between 
random node failures and classical attacks, even if this difference is quite small in the 
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Figure 22: internetl network. From left to right: degree distribution, node removals and 
link removals. 
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Figure 23: internet-core network. From left to right: degree distribution, node removals 
and link removals. 
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Figure 24: protein network. From left to right: degree distribution, node removals and 
link removals. 



cases of actor and p2p networks. The same is true for almost-random node attacks: in 
most cases, they are an intermediate between random nodes failures and classical attacks. 
Notice however that in several cases, namely actor, cooccurrence, internet-core and p2p, 
the behaviors in the case of random node failures and almost-random node attacks are 
very similar. This is due to the fact that, as pointed out above, there are few nodes of 
degree 1 with respect to the total number of nodes. 

Concerning link removals, we observe two different types of behaviors. The co- 
authoring, internetl, protein, and www networks more or less conform to the expected 
behavior. Notice however that the www network is very resilient to all link removal 



59 




Degree Fraction nodes removed Fraclion links removed 



Figure 25: www network. From left to right: degree distribution, node removals and link 
removals. 
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Figure 26: p2p network. From left to right: degree distribution, node removals and link 
removals. 



strategies. We will come back to this point later. 

The actor, cooccurrence, internet- core, and p2p networks, however, behave very differ- 
ently from what was expected. For these networks, almost all links have to be removed to 
simply reduce significantly the size of the largest connected component, for all strategies 
except node failures seen from the link point of view. This can be explained as follows. 
First, these networks have very strong average degrees: they are the four networks with 
the strongest average degree. Denser networks are naturally more resilient to link re- 
movals. Also, notice that these networks have a relatively low fraction of nodes of degree 
one, compared to the other networks. This naturally induces that a very large fraction of 
the links are attached to the highest degree nodes. The removal of a few of these nodes 
does not have a strong impact on the network, however this induces the removal of most 
of the links, causing the plot for classical attacks seen from the link point of view to be 
almost horizontal in the beginning. The fact that there are few nodes of degree one also 
explains why random link failures and almost-random link attacks behave very similarly. 
Finally, counter-intuitively, the most efficient link removal strategy for these networks 
is node failures seen from the link point of view (though this strategy cannot be called 
efficient in itself, since all links must be removed to break down the networks): random 
node failures cause the size of the largest connected component to decrease linearly in this 
case (meaning that the largest connected component contains all nodes except the ones 
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actually removed by failures). We have seen in Proposition 12.111 that, when a fraction p 
of the nodes are removed by random failures, the corresponding fraction of removed links 
is 2p — p 2 . The size of the largest component as a function of the fraction of removed links 
is therefore expected to evolve as y/1 — x, which corresponds to the shape of the random 
node failures seen from the link point of view plot. 

It must be noticed finally that though the actor, cooccurrence, internet-core, and p2p 
networks behave similarly and have high average degrees, there is a large difference in 
the average degree of the internet-core network and the one of the three others. The 
average degree of the www network is in fact almost the same as the one of the internet- 
core network, but it behaves differently (though it is indeed very resilient to link removal 
strategies). This means that the average degree does not uniquely determine the behavior 
of a network, and that other properties are in cause. 

The behaviors of the two maps of the internet are very different, which shows that 
one must be very careful when deriving conclusions about such networks. Indeed, the 
measurement procedure only gives a partial and biased view, see EH EZl ESI ED EH ES] • 
This moderates the often claimed assertion that the internet is very robust to failures 
and very sensitive to attacks, which has been derived from such experiments, typically 
conducted on maps of the kind of internet 1. 

In this last case, one may notice that almost-random link attacks destroy the network 
more efficiently than classical attacks viewed from the link point of view. This indicates 
that the robustness of this network in case of failures is strongly due to the fact that the 
amount of nodes of degree 1 is huge. 

In conclusion, these experiments showed that, concerning random node failures and 
classical attacks, real-world networks behave accordingly to theoretical predictions. When 
taking into account almost-random node attacks, as well as link removal strategies, how- 
ever, more subtle behaviors occur. 

This shows that all the aspects we have discussed in this paper must be taken into 
account when dealing with practical cases. This also shows that much remains to be done 
to fully understand the observed phenomena; we will discuss this further in the conclusion. 
On the other hand, one may see the study of the resilience of a given network as a way to 
deepen the understanding of its structure and point out some non-trivial features which 
should be explored. This is the case, for instance, of the remarks we made on the www 
network above. 

5 Conclusion and discussion. 

In this contribution, we focused on a set of previously known results which received much 
attention in the last few years. These results state that, although power-law networks 
are very resilient to random (node or link) failures and Poisson ones are not, they are 
very sensitive to a special type of attacks (which we call classical attacks) consisting of 
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removing the highest degree nodes first, while Poisson networks are not. This had led to 
the conclusion that its power-law degree distribution may be seen as an Achille's heel of 
the internet [9T1 [TO] . 

These results were first obtained empirically (7J[20], but an important analytic effort 
has been made to prove them with mean-field and asymptotic approximations [301 SB EH 
|25| [96] using two different techniques. 

Our first contribution is to give a unified and complete presentation of these results 
(both empiric and analytic ones). Since some of the involved techniques (in particular 
mean-field approximations) are unusual in computer science, we emphasized on the meth- 
ods used and gave much more detailed proofs than in the original papers. In particular, 
we pointed out the approximations where they occur, discussed them in the light of the 
experiments, and tried generally to give a didactic presentation. 

Our second contribution is to introduce some new results on cases which received less 
attention, maybe because these results are less striking. They are however essential for 
deepening one's understanding of this topic. We focused in particular on two aspects: 
studying the finite case, and studying the link point of view of random node failures, and 
classical attacks. We also introduced new attacks, very similar to random failures, which 
allowed us to deepen our understanding of the phenomena at play. 

Finally, we conducted extensive simulations, on random graphs of different types in 
order to confront them to theoretical results, but also on real-world networks, which put 
in evidence complex behaviors, not attributable to the sole degree distribution. 

All this showed that many of the classical conclusions of the field should be discussed 
further. We may now put all these results and their relations together to derive global 
conclusions. 

Concerning random node and link failures (i.e. random node and link removals), the 
striking point is that, although analysis predict completely different behaviors for Poisson 
and power-law networks, in practice the differences, though important, are not huge (see 
Tables [2] to H]) . This is even more pronounced for link failures. This overestimation of 
the difference was due to the study of the infinite limit and to the approximations made. 
It may also be a consequence of our choice to consider that a network breakdown occurs 
when the size of the the giant component reaches 5% of all nodes, but other conventions 
lead to similar conclusions. 

Concerning classical attacks (i.e. removal of nodes in decreasing order of their degree), 
we have shown that, although the thresholds for power-law networks indeed are very low, 
and much smaller than the ones for Poisson networks, our other observations tend to 
moderate this conclusion. Indeed, as one may have guessed, the number of links removed 
during a classical attack is huge. When one considers the number of removed links, 
power-law networks are not more fragile than Poisson ones. 

The large number of removed links, though it clearly plays a role, is however not 
sufficient to explain the efficiency of classical attacks: if one removes the same fraction of 
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links randomly, then there is no breakdown. This invalidates the often claimed explanation 
that classical attacks are very efficient on power-law networks because they remove many 
links. 

Going further, if one removes the same, or even a smaller, fraction of links, but almost 
randomly (i.e. randomly among the ones which are linked to nodes of degree at least 2) 
then a breakdown occurs. In terms of the fraction of removed links, classical attacks 
therefore lie between random link failures and almost-random link attacks, which makes 
them not so efficient. 

Finally, the efficiency of classical attacks resides mainly in the fact that it removes 
many links, and that these links are mostly attached to nodes of degree larger than 1. 
Conversely, this explains the robustness of power-law networks to random node failures: 
such failures often remove nodes of degree 1 and/or links attached to such nodes. 

Another conclusion of interest comes from the study of classical attacks on Poisson 
networks (which was not done in depth until now). Although these networks behave 
similarly in case of random node failures and classical attacks, it must be noted that their 
threshold is significantly lower in the second case. This goes against the often claimed 
assumption that, because all nodes have almost the same degree in a Poisson network, 
there is little difference between random node failures and classical attacks. This is worth 
noticing, since it reduces the difference, often emphasized, in the behavior of Poisson and 
power-law networks. 

The observation of practical cases in Section 0] also provided interesting insights: in 
several cases, some observed behaviors may be explained using the results in this paper and 
our knowledge of the properties of the underlying network. It appears clearly however that 
other properties than degree distributions play an important role on network resilience. 
This points out interesting directions for further analysis. Conversely, this shows that one 
may see the study of a particular network's resilience as a way to obtain some insight on 
its structure. 

All these results led us to the conclusion that, although random node failures and 
classical attacks clearly behave differently and though the Poisson or power-law nature of 
the network has a strong influence in this, one should be careful in deriving conclusions. 
This is confirmed by our experiments on real-world networks. The sensitivity of networks 
to attacks relies less on the presence of high-degree nodes than on the fact that they have 
many low-degree nodes. Conversely, their robustness to failures relies strongly on the fact 
that when we choose a node at random, we choose such a node with high probability, and 
not so much on the fact that high-degree nodes hold the network together. Moreover, the 
fact that a classical attack on a power-law network removes many links may be considered 
as partly, but not fully, responsible for its rapid breakdown. 

Although this paper is already quite long, we had to make some choices in the pre- 
sented results, and there are of course many omissions. For instance, we did not mention 
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random networks with degree correlations, on which interesting results exist [T71 1119j . or 
other types of modeling, for instance the HOT framework [161 [82] . We also ignored the 
various contributions considering other attack strategies [80j EH EH I100L l22l 11011 l67l l52l 
1901 f89l 1881 11081 1124j and other definitions of the robustness than the size of the giant com- 
ponent [TBI ETJ 11011 1371 [821 HE). Likewise, we could have compared real-world networks 
in Section H] with random networks having exactly the same degree distribution, which 
would certainly be enlightening. It would be interesting also to compare more precisely 
the results obtained with other practical definitions of a network breakdown. 

Presenting and discussing all these aspects is impossible in a reasonable space; instead, 
we chose to deepen the basic results of the field, which we hope leads to a significantly 
improvement of our understanding of them. This work could serve as a basis for further 
deepening of some aspects we have voluntary omitted, such as the ones pointed out above. 

Going further, it must be clear that other properties than the degree distribution, 
which should be investigated, certainly play a role in real-world complex networks' ro- 
bustness. This appears clearly in Section [U where several classes of behaviors appear. 

It seems obvious, for instance, that the fact that nodes of real-world complex networks 
are organized in communities (groups of densely connected nodes), which is also captured 
in part by the notion of clustering coefficient [HI [57J [7U [531 EH Ell EH [77] , plays a key 
role. However, these properties are not captured by the random networks we considered 
here. 

Studying robustness of networks with more subtle properties than degree distribu- 
tions would therefore be highly relevant, but most remains to be done. In particular, 
while there is a consensus on the modeling of networks with a given degree distribution in 
the community (through the use of the configuration model [12], like here, or the prefer- 
ential attachment principle [1]), there is no consensus for more subtle properties like the 
clustering. Many models have been proposed, but each has its own advantages and draw- 
backs. Some of them seem however well suited for analysis, as they are simple extensions 
of the configuration model [61, 60J or of the preferential attachment one [4~4"1 |4"5]. 

Finally, let us insist once more on the necessity of developing formal results to enhance 
our understanding of empiric results. It makes no doubt that experiments (in our case first 
obtained in [3 ED]) bring much understanding and intuition on phenomena of interest. The 
need for rigor and for a deeper understanding of what happens during these experiments 
is however strong. It led to several approaches to analyze them. The main ones in our 
context are developed in [301 EE1 EH [25], [96] . They all rely on mean-field approximations, 
and we gave here the details of the underlying approximations and assumptions. Such 
an approach is definitively rigorous, but is not formal. Obtaining exact results, or even 
approximate results, with formal methods would be another improvement. Some results 
begin to appear in this direction [19J, but much remains to be done and the task is 
challenging. 
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